Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Making database for sugarcane ends without warning or error #101

Open
Windyxia11 opened this issue Nov 6, 2024 · 0 comments
Open

Making database for sugarcane ends without warning or error #101

Windyxia11 opened this issue Nov 6, 2024 · 0 comments

Comments

@Windyxia11
Copy link

Hi Pauline,
When I used SIFT4G_Create_Genomic_DB to create the sugarcane database, I encountered a problem. While creating databases for each chromosome in sugarcane, some chromosomes successfully created databases, but others did not. There were no warning or error messages for the failures. I don't know why this is happening. Here are the outputs when the process succeeded and failed:
the succeeded output message:

** Checking query data and substitutions files **

  • processing queries: 100.00/100.00% *

** Searching database for candidate sequences **

  • processing database part 364 (size ~0.25 GB): 100.00/100.00% *

** Aligning queries with candidate sequences **

  • processing database part 91 (size ~1.00 GB): 100.00/100.00% *

** Selecting alignments with median threshold: 2.75 **

  • processing queries: 100.00/100.00% *

** Generating SIFT predictions with sequence identity: 100.00% **

  • processing queries: 100.00/100.00% *

the failed output message:

** Checking query data and substitutions files **

  • processing queries: 100.00/100.00% *

** Searching database for candidate sequences **

  • processing database part 364 (size ~0.25 GB): 100.00/100.00% *

Uniref90 was utilized for the database creation.
The config file:

GENETIC_CODE_TABLE=1
GENETIC_CODE_TABLENAME=Standard
MITO_GENETIC_CODE_TABLE=11
MITO_GENETIC_CODE_TABLENAME=Plant Plastid Code

PARENT_DIR=/xtdisk/apod/xiehx/Deleterious_variants/SIFT/Saccharum/SIFT_Database/Chr8D
ORG=Saccharum_spontaneum
ORG_VERSION=Np-X

#Running SIFT 4G
SIFT4G_PATH=/gpfs/biosoft/app2/python2024/envs/sift4g/bin/sift4g
PROTEIN_DB=/xtdisk/apod/xiehx/Deleterious_variants/SIFT/Saccharum/SIFT_Database/config/uniref90.fasta

Sub-directories, don't need to change

GENE_DOWNLOAD_DEST=gene-annotation-src
CHR_DOWNLOAD_DEST=chr-src
LOGFILE=Log.txt
ZLOGFILE=Log2.txt
FASTA_DIR=fasta
SUBST_DIR=subst
ALIGN_DIR=SIFT_alignments
SIFT_SCORE_DIR=SIFT_predictions
SINGLE_REC_BY_CHR_DIR=singleRecords
SINGLE_REC_WITH_SIFTSCORE_DIR=singleRecords_with_scores
DBSNP_DIR=dbSNP

Doesn't need to change

FASTA_LOG=fasta.log
INVALID_LOG=invalid.log
PEPTIDE_LOG=peptide.log
ENS_PATTERN=ENS
SINGLE_RECORD_PATTERN=:change:_aa1valid_dbsnp.singleRecord

Could you give me some suggestions and help?
Thank you very much for your advice and time!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant