-
Notifications
You must be signed in to change notification settings - Fork 0
Create a "Diploid Genome" and "Diploid GTF GFF" file (method A)
Kiran N' Bishwa edited this page Jun 11, 2018
·
1 revision
$ bcftools view -s MA625 phasedVCF-short.vcf.gz -U > phasedVCF-short-MA625.vcf
# bgzip the VCF (bcftools needs this exclusively)
$ bgzip phasedVCF-short-MA625.vcf
$ tabix -f phasedVCF-short-MA625.vcf.gz
$ bcftools consensus -c MA625-left.chain -f reference.fasta phasedVCF-short-MA625.vcf.gz -s MA625 -H 1 > MA625-left.fa
$ bcftools consensus -c MA625-right.chain -f reference.fasta phasedVCF-short-MA625.vcf.gz -s MA625 -H 2 > MA625-right.fa
liftOver is separate standalone tool provided by UCSC can can be downloaded as single binary executable file http://hgdownload.cse.ucsc.edu/admin/exe/linux.x86_64/
$ liftOver -gff reference.gff3 MA625-left.chain MA625-left.genome.gff3 MA625-left.unmapped
$ liftOver -gff reference.gff3 MA625-right.chain MA625-right.genome.gff3 MA625-right.unmapped
Step 04 : The new strain based reference fasta and gff files will need additional parsing to add appropriate flags.
for eg:
from
to
This can be done using awk or more simply using python.
bedtools getfasta [OPTIONS] -fi -bed <BED/GFF/VCF> -fo
Competitive alignment can be do on either Reference Transcriptome, Reference Genome or Both.
- Alignment to Diploid Transcriptome can be done using
Bowtie
- Alignment to Diploid Genome can be done using using any reference genome base RNAseq alignment tool.
- Alignment to both Genome and Transcriptome can be done using rnaSTAR.