-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can I use or add a different translation table for the genetic code (working with parasites)? #200
Comments
Greetings @azmigueldario, Thank you for your interest. Most modules in the latest version, v3.3.6, allow users to select or autodetect the genetic code. The CreateSchema and AlleleCall modules allow you to specify the genetic code through the Kind regards, Rafael |
Thank you for the quick reply. I will use the standard code then (#1). I saw that the tables are restricted to a few most commonly used and related to bacterial pathogens, see error output below. I believe the table is used in a function imported from Thanks again, Authors: Rafael Mamede, Pedro Cerqueira, Mickael Silva, João Carriço, Mário Ramirez
Github: https://github.com/B-UMMI/chewBBACA
Documentation: https://chewbbaca.readthedocs.io/en/latest/index.html
Contacts: [email protected]
==================================
chewBBACA - PrepExternalSchema
==================================
Started at: 2024-06-27T17:06:30
Invalid genetic code value.
Value must correspond to one of the accepted genetic codes
Accepted genetic codes:
1: Standard
4: The mold, protozoan, and coelenterate mitochondrial code and the mycoplasma/spiroplasma code
11: The Bacterial, Archaeal and Plant Plastid code
25: Candidate division SR1 and gracilibacteria code |
Hello @azmigueldario, I must be playing Jedi mind tricks on myself since I forgot about that step to validate the genetic code. It should accept more than those four genetic codes. I will add more genetic codes to the dictionary with the accepted values so that it still validates the value passed. This does not guarantee that it will work for any organism; it still depends on Pyrodigal/Prodigal, which was designed for Bacteria and Archaea. What is the genetic code that you would like to use? Best regards, Rafael |
Hi Rafael, thank you for your reply.
I am interested in code number 6 and you are right that it may not work
properly, I am working with Giardia which seems to have a somewhat similar
genome structure to bacteria so I hope it works.
…On Fri, Jun 28, 2024 at 2:45 AM Rafael Mamede ***@***.***> wrote:
Hello @azmigueldario <https://github.com/azmigueldario>,
I must be playing Jedi mind tricks on myself since I forgot about that
step to validate the genetic code. It should accept more than those four
genetic codes. I will add more genetic codes to the dictionary with the
accepted values so that it still validates the value passed. This does not
guarantee that it will work for any organism; it still depends on
Pyrodigal/Prodigal, which was designed for Bacteria and Archaea. What is
the genetic code that you would like to use?
Best regards,
Rafael
—
Reply to this email directly, view it on GitHub
<#200 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ANF5JEHQO2CAI3BLO4TZUQ3ZJUWEHAVCNFSM6AAAAABJ7CHTUGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCOJWGUZDKNJTHA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
--
*Miguel D. Prieto-Gaez, MD, MSc *
*(He, Him)*
|
Hello @azmigueldario, We released chewBABCA v3.3.8, which adds support for the remaining genetic codes supported by Prodigal (complete list here), including genetic code 6. I tested the new options with Giardia genomes available on the NCBI. I downloaded the reference genome for Giardia intestinalis (GCF_000002435.2) and created Prodigal training files based on that genome and genetic codes 1 and 6. I used the following commands: Genetic code 1:
Genetic code 6:
I then used the reference genome and the training files to create a schema for each genetic code with the CreateSchema module. After that, I downloaded all the Giardia genomes (n=38, 36 Giardia intestinalis, 1 Giardia muris, 1 Giardia lamblia) from the NCBI and performed allele calling with the AlleleCall module to identify new alleles to add to the schemas.
To get an idea about the number of loci that Prodigal might be predicting well, I used the UniprotFinder module to compare the schema loci against the Giardia reference proteomes available on UniProt (n=3, UP000001548, UP000315496, UP000000350). It found annotations for loci in both schemas. Still, the loci in the schema created with genetic code 1 were more similar to what's in the reference proteome for Giardia intestinalis (found proteome annotations for 4,594 loci in the schema created with genetic code 1 and for 3,540 loci in the schema created with genetic code 6). The Giardia muris genome seems to differ considerably from the reference genome for Giardia intestinalis, so the schemas could not classify most CDSs predicted for Giardia muris. Several Giardia intestinalis available on the NCBI seem to be of low quality (e.g. highly fragmented or scaffolded), which can lead to high numbers of missing/non-identified loci for those genomes and a small core genome if you determine the core loci from results including those genomes. Best regards, Rafael |
Thank you very much @rfm-targa for adding the table and taking the time to look into the functionality for Giardia. I will likely stay with the standard table or run both to compare. Thank you very much for all the help. |
I want to use the ideal table for the parasite I am working on but it is not supported. I could not find where the functions inherit the table to modify the code.
Would it be possible and simple to add custom translation tables?
Great tool by the way.
The text was updated successfully, but these errors were encountered: