-
Notifications
You must be signed in to change notification settings - Fork 0
Home
Roshan edited this page Jan 3, 2020
·
6 revisions
is a set of scripts for whole genome comparison. It requires python3 and few modules.
should be compatible with python3
- pandas
- numpy
- biopython
for installing all the modules you can use an Anaconda distribution of python3
- Emboss
- ProteinOrtho [ tested with v4 ]
- Blastall for running ProteinOrtho
- Prodigal for generating inputs for ProteinOrtho
- Inputs
Input is a directory containing all pep and nuc files
pep and nuc files can be generated using another script called runProdigal.py
File name extension should be Absolutely .pep and .nuc
If you are giving your own .pep and .nuc files be sure that they both have same sequence ids inside Other files in the directory will not be considered
- Output
Output is a csv file containing AGIOS values and number of orthologus pairs between each pair of genomes.
usage: magi.py [-h] -g GENOME_FILES -p PROTEINORTHO_RESULT_FILE -o OUT_PATH [-c {0,1}]
The main MAGI script.
Inputs are ProteinOrtho result file, path to the cds files , output directory path and calculate agios for core genes
optional arguments:
-h, --help show this help message and exit
-g GENOME_FILES path to input directory
-p PROTEINORTHO_RESULT_FILE path to ProteinOrtho result file
-o OUT_PATH Out put directory path
-c {0,1} calculate agios for core genes (0 for no cores 1 for yes)
Running Prodigal
usage: runProdigal.py [-h] -g GENOME_LIST [GENOME_LIST ...]
This is a suplimentary script which runs prodigal and parse the file and give correct filenames
Requirements: prodigal
optional arguments:
-h, --help show this help message and exit
-g GENOME_LIST [GENOME_LIST ...] path to the genome files
PreProcessing
need to update this script
- working on threading for improving speed
- A polyphasic strategy incorporating genomic data for the taxonomic description of novel bacterial species.International Journal of Systematic and Evolutionary Microbiology (2014), 64 ,384-391. [DOI: 10.1099/ijs.0.057091-0] .