Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Read mapping is very slow on diploid human genome assembly #28

Open
jeizenga opened this issue Jun 15, 2023 · 0 comments
Open

Read mapping is very slow on diploid human genome assembly #28

jeizenga opened this issue Jun 15, 2023 · 0 comments

Comments

@jeizenga
Copy link

I tried to use VerityMap to validate a diploid human genome assembly using HiFi reads, but on my data it was too slow to be practical. I let it run for >3 weeks one 16 threads, and it only mapped up to about 4x. Is this speed expected? Are there any tweaks I can make to increase it?

The command I ran was

python3 main.py --reads reads.fastq.gz -o verity_map_output -t 16 -d hifi-diploid \
    assembly.haplotype1.fasta assembly.haplotype2.fasta

Another question/request: I understand from the paper that VerityMap also includes analysis modules to detect the location of misassemblies. As far as I can see, these can only be accessed after read mapping concludes (I believe the relevant code is here). Is this correct? It would be useful if the interface allowed a more modular option that could be run independently of mapping, especially since it seems like I will need to troubleshoot the mapping stage.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant