-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Help with using bioconda version of jaffa #57
Comments
Hi, If you let me know how long your reads are, I'm also happy to advise on which "mode" of jaffa to use. Assembly is usually not the best option for most data. Hope some of this is helpful. Cheers, |
Thank you Nadia I'll let you know how your suggestions go. If you don't mind what mode of jaffa should I be using if I am dealing with RNA-seq from FFPE? the reads are normally 125 bp long but after trimming they might average around 100 with some being less. |
I think the direct pipeline should work best for your data. You could also try it without trimming as the pipeline is pretty robust to bad sequence at the ends. Cheers, |
Hi Nadia I am trying to install JAFFA directly but its having trouble installing the following files: WARNING: extract_seq_from_fasta could not be found!!!! You will need to download and install extract_seq_from_fasta manually, then add its path to tools.groovy WARNING: minimap2 could not be found!!!! You will need to download and install minimap2 manually, then add its path to tools.groovy |
Actually I just the program anyway despite the warnings and it looks like there is an output but I can't see the csv file. I only see bam files, fasta files and paf files. While this does show what reads are discordant pairs it is difficult for me to see what fusions are found from running the program. I was wondering am I supposed to get a summary csv file? |
These were the error messages: =========================== Stage align_reads_to_annotation (CC65CANXX) ============================ real 17m52.671s =============================== Stage filter_transcripts (CC65CANXX) =============================== /home/danwoo/bin/JAFFA-version-2.1/tools/bin/process_transcriptome_align_table CC65CANXX/CC65CANXX.paf 1000 /home/danwoo/bin/JAFFA-version-2.1/hg38_genCode22.tab > CC65CANXX/CC65CANXX.txt ========================================= Pipeline Failed ========================================== One or more parallel stages aborted. The following messages were reported: -------------------------------- filter_transcripts ( CC65CANXX ) -------------------------------- Command in stage filter_transcripts failed with exit status = 134 : /home/danwoo/bin/JAFFA-version-2.1/tools/bin/process_transcriptome_align_table CC65CANXX/CC65CANXX.paf 1000 /home/danwoo/bin/JAFFA-version-2.1/hg38_genCode22.tab > CC65CANXX/CC65CANXX.txt |
Hi, Cheers, |
Hi Nadia Unfortunately I'm not getting any hits with the command you sent. Regards, |
Oh perhaps it's Can you also paste the result of Cheers, |
head CC65CANXX/CC65CANXX.paf HWI-D00119:248:CC65CANXX:3:1101:10000:74092/1 125 24 113 minus hg38_wgEncodeGencodeCompV22_ENST00000469930.1__range=chr7:140834061-140924709__5'pad=0__3'pad=0__strand=-__repeatMasking=none 1058 1058 969 90 90 167 |
tail CC65CANXX/CC65CANXX.paf HWI-D00119:248:CC65CANXX:3:2316:9999:41567/1 125 11 89 minus hg38_wgEncodeGencodeCompV22_ENST00000494445.1__range=chr3:186787314-186789291__5'pad=0__3'pad=0__strand=+__repeatMasking=none 629 415 337 79 79 147 |
grep -C4 "HWI-D00119:24" CC65CANXX/CC65CANXX.fasta results in many lines of sequence reads so I think it would be too long to copy here. |
Thanks, these look okay to me. Can you do a little experiment for me: Also, which version of gcc do you have (gcc --version). |
From running the command I get: also gcc version is 4.8.5 |
Thanks for you quick replies. How big is your whole CC65CANXX/CC65CANXX.paf file? Would it be too big to email? Also, were you able to run the demo data okay? Thanks for your patience with all the questions! |
It doesn't look too big to email so if you provide me with an address I can send it to you. I have yet to actually run the demo data |
email is: [email protected] |
Thanks, I was able to reproduce the error with your file. It looks like one the libraries that JAFFA uses fails for gcc version 4.8. If you can switch to a higher gcc version, I think this will resolve the problem. I’ve tested with gcc 6.3 and 8.2 and it works okay with both. You would need to remove the “bin” directory under “tools” where you installed JAFFA and then rerun the script, install_linux64.sh. Let me know how you go with it. Hopefully you have a newer version of gcc available? |
hi JAFFA team, thanks for outstanding job, ======================= Stage filter_transcripts (merged_enrichment_sAML1b) ======================== by running which gcc : Running gcc -v: thanks in advance |
Thanks for reporting this. Do you also get this error on other datasets (e.g. the example data)? Other versions of gcc (e.g. 5 or 6)? Did you reinstall after upgrading gcc? To reinstall you'd need to remove /tools/bin/process_transcriptome_align_table and then rerun /install_linux64.sh Cheers, |
@nadiadavidson Thanks |
Hi, I suspect you'd get the same error with the example dataset, but it could be useful to know that for sure, otherwise it would be difficult for me to reproduce the error. If you are running on a server/cluster could you use the gcc version installed on there rather than conda's version? If all else fails I can probably add a static binary to the repository/next release to avoid compilation. Cheers, |
Hi, |
Overlaps with Issue #74 , so will close this one. |
Hi
So far I have had no success installing JAFFA directly based on the instructions since I am on a shared server and do not have root access or privileges. So I tried installing it through anaconda but after the installation the files appear to be named different than if you were to download the package from the github wiki.
So to take a wild guess I thought the jaffa-assembly script would do what I wanted. Turns out the main problem I having here is that it can't seem to find the reference annotation files. Now since this is anaconda I am not sure which directory to download these files into. Do you think you can guide me on usage of jaffa-assembly?
here is the error message I get:
echo "Running JAFFA version 2.1" ; echo "Checking for required data files..." ; for i in null/hg38_genCode22.fa null/hg38_genCode22.tab /scratch_local/708254.1.short.q/tmp.pLyPvY
R68t/known_fusions.txt null/hg38.fa null/Masked_hg38.1.bt2 null/hg38_genCode22.1.bt2 ; do ls $i 2>/dev/null || { echo "CAN'T FIND $i..." ; echo "PLEASE DOWNLOAD and/or FIX
PATH... STOPPING NOW" ; exit 1 ; } ; done ; echo "All looking good" ; echo "running JAFFA version 2.1.. checks passed" > checks
echo "Running JAFFA version 2.1" ; echo "Checking for required data files..." ; for i in null/hg38_genCode22.fa null/hg38_genCode22.tab /scratch_local/708254.1.short.q/tmp.pLyPvY
R68t/known_fusions.txt null/hg38.fa null/Masked_hg38.1.bt2 null/hg38_genCode22.1.bt2 ; do ls $i 2>/dev/null || { echo "CAN'T FIND $i..." ; echo "PLEASE DOWNLOAD and/or FIX
PATH... STOPPING NOW" ; exit 1 ; } ; done ; echo "All looking good" ; echo "running JAFFA version 2.1.. checks passed" > checks
The text was updated successfully, but these errors were encountered: