[TOC]
PFGI is a bioinformatics pipeline for fungal genome identification and annotation.
- Download: clone from github: git clone https://github.com/lyotvincent/PFGI.git
- Configuration file: User could set software parameters in pfgi_configuration_template.xlsx and use "-conf_file_path pfgi_configuration.xlsx" to let program know where the configuration file is.
- Download fungal genomes & build database.
python download_fungi_data.py
. Next, the database path should be written in the configuration file. - run this command to get 'help': python fungi_pipeline.py -h
help information:
-f #input single end file (fastq format)
-1 and -2 #input paired end files (fastq format)
-ngs or -3gs #choose which mode to run (Required)
-conf_file_path #the configuration file path (Required)
-o #out directory name
-h or -help or --h or --help #get help information
a example:
python ../PFGI/fungi_pipeline.py -1 SRR7530142_1.fastq -2 SRR7530142_2.fastq -ngs -conf_file_path ../PFGI/pfgi_configuration_template.xlsx -o SRR7530142
These software/tools respectively support part of the entire pipeline. If you want to use all the functions of the pipeline, all these software in the table should be installed.
The ✔ in 'conda' column means that the software cound install by conda.
An example for installing these software :
conda install -y fastqc