-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fail to assembly de-novo genome #2
Comments
Hi, Thanks for trying VStrains and sorry for the late reply,
Feel free to let me know if there are further questions, |
I've also tried --rnaviral option of spades and there are two types of outcome with VStrains in case kmer=0:
Attaching logs and files for both cases: Second: |
I think it is likely due to the input graph is already disconnected since inferred k-mer size=0 and only 3 rows in the GFA file, and VStrains does not handle disconnected graphs (no edges) since no further graph simplification can be done post SPAdes --rnaviral. I've pushed the changes to make sure VStrains exit when such case exists. The key reason for running forever is due to paired-link inference attempts to infer (k+1)-mer linkage from paired-end reads, when k=0, inference step infers all 1-mer linkage, such search space is infeasible and won't provide any useful information. |
Hello VStrains team, thank you for developing such a great tool, but while using it, I faced the following problem.
I attempted to assemble the complete HIV genome from this sample: SRR29407826. I used corona-spades and it worked fine.
However VStrains crashes when assembling it.
First, there was an error related to rev_dict in VStrains_PE_Inference.py. It doesn't had lowercase nucleotides in it and therefore raised KeyError. I fixed it replacing:
rev_dict = {"A": "T", "T": "A", "C": "G", "G": "C"}
with this:
rev_dict = { "A": "T", "T": "A", "C": "G", "G": "C", "a": "t", "t": "a", "c": "g", "g": "c" }
But new issue occurred, after messages in CLI log:
It freezes forever and do not proceed any further.
Worth mentioning details
In the same log there is a suspicious message:
Also VStrains can't read assembly_graph_after_simplification.gfa file (which is the output of spades) without changing its version in header from 1.2 to 1.0 manually.
Steps to reproduce
Files with reads:.
SRR29407826.zip
VStrains log:
vstrains.log
Spades log:
spades.log
The text was updated successfully, but these errors were encountered: