Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Invalid genotypes annotate_ase #2

Open
illarionovaanastasia opened this issue Apr 29, 2021 · 3 comments
Open

Invalid genotypes annotate_ase #2

illarionovaanastasia opened this issue Apr 29, 2021 · 3 comments

Comments

@illarionovaanastasia
Copy link

illarionovaanastasia commented Apr 29, 2021

Hi!
I am trying to run annotation of calc_ase output. The command I am using for this:
annotate_ase -f sample.vcf.gz -i calc_ASE/sample.tsv -b gencode.bed -o annot_ase.tsv
The run fails during a Fetching genotypes step with a following error:

Traceback (most recent call last):
  File "/home/ubuntu/miniconda3/bin/annotate_ase", line 33, in <module>
    sys.exit(load_entry_point('LORALS==0.0.0', 'console_scripts', 'annotate_ase')())
  File "/home/ubuntu/miniconda3/lib/python3.8/site-packages/LORALS-0.0.0-py3.8.egg/lorals/scripts.py", line 604, in annotate_ase
  File "/home/ubuntu/miniconda3/lib/python3.8/site-packages/LORALS-0.0.0-py3.8.egg/lorals/annotate.py", line 223, in annotate_genotypes
  File "/home/ubuntu/miniconda3/lib/python3.8/site-packages/LORALS-0.0.0-py3.8.egg/lorals/features.py", line 173, in fromvcf
  File "/home/ubuntu/miniconda3/lib/python3.8/site-packages/LORALS-0.0.0-py3.8.egg/lorals/features.py", line 193, in __init__
  File "/home/ubuntu/miniconda3/lib/python3.8/site-packages/LORALS-0.0.0-py3.8.egg/lorals/features.py", line 165, in _valid_genotype
ValueError: invalid genotypes

The genotype VCF was obtained with a process_vcf.sh script from a correctly formatted phased VCF.
I would really appreciate any feedback.

Best regards,

Anastasia

@Marijn-Schipper
Copy link

Hi Anastasia,

I had a similar issue. Does your VCF file contain genotypes other than 0|1, 1|0, 1|1?
1|2 for example? Removing these solved the issue for me.
You could always temporarily reannotate 2|1 and 1|2 variants as 1|0 and 0|1, and revert this in the output.

Kind regards,
Marijn

@DafniG
Copy link
Contributor

DafniG commented May 20, 2021

Hi both,
Sorry for the delay in responding. Thank you Marjin for the suggestion. Could you confirm Anastasia whether the problem was fixed by changing the genotypes? If so, I can updated both the process_vcf.sh script and I will update the instructions to reflect it.
Thank you,
Dafni

@illarionovaanastasia
Copy link
Author

Hi both,

Thank you for the responses. I also have figured out that the issue is in the unexpected format of genotypes in the input VCF and edited the script. I confirm that it solved the problem.
Thank you!

Best regards,

Anastasia

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants