-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use of paired end illumina data for trio bining with meryl #122
Comments
You can run Note there is a slight mistake in the child count command - the output should be |
Thank you for your quick reply.
And I don't have illumina for F1. I was confused because as per the manual below
Child data was generated using So is it necessary to generate child k-mer count file for trio. |
As it says in the readme, You could also use the HiFi data as the child, definitely not the simplex ONT data. |
I ran the command as mentioned in the read me. But the input for verkko is missing . But two files were not produced It produced |
Yes, that is normal, as I said before, the file names are only.meryl in the case of no child input. |
Thank you for the quick reply and support. Is the command wrong .Because when i run the command it shows the help page with |
The command is fine but it appears you didn't build the initial maternal/paternal databases with the compressed option. This is done before you run hapmers.sh. How did you build the initial meryl databases? Try running |
I used suggested command to generate the file
|
Yeah those are definitely not compressed databases, there shouldn't be any homopolymers runs. What's the version of meryl? Did you first remove all previous output meryl databases before running the commands? Try removing all your meryl databases and running the command for just one of the parents and post the full log it generates here. |
log.txt |
Unfortunately it seems the meryl log doesn't report whether it's doing compressed or uncompressed counts. Older versions of meryl that didn't support compression would complain that the parameter is invalid. Newer versions all worked correctly for me as long as That is:
and
as two separate commands. meryl --version will report the version. The compressed DB should look the same just not have any homopolymers tracts, e.g.:
|
I traced the problem to be with the tool verkko. It has nothing to do with the meryl . |
Your conclusion is incorrect, verkko can work with our without any child data. Since the input to it is just two meryl databases, it has no way to know if they are constructed from just the parents, the parents + children, or even reads binned using HiC or Strand-seq data as we did in the manuscript. The initial databases you had were not homopolymer compressed which verkko definitely cannot work with. It is likely your second run was properly compressed and thus the run worked. We did recently see a bug with meryl when empty files are passed that databases are not compressed: marbl/meryl#31, perhaps this caused your issue building DBs. |
Hello, I'm also having the same issue with meryl-1.4. Running
I tried removing everything and rebuilding them, but it did not help |
Compiling meryl from the sources helped to solve this issue, before I was using the released binaries. |
PROCESSING TREE #1 using 128 threads. |
Hi,
i wanted to use the tool for de novo assembly using trio binning approach.
I have paired end parental reads from illumina , F1 hifi reads and F1 nanopore reads .
I was wondering how to use the below commands mentioned in the manual of verkko.
meryl count compress k=30 threads=XX memory=YY maternal.*fastq.gz output maternal_compress.k30.meryl
meryl count compress k=30 threads=XX memory=YY paternal.*fastq.gz output paternal_compress.k30.meryl
So should i just cat the R1 and R2 of paired end reads to genetare a single file for both the parents and then run the command.
like cat R1 R2 > Maternal.fastq.gz
cat R1 R2 > Paternal.fastq.gz
and the use the above command.
Or should i use count forward and count reverse and the union command to generate the maternal_compress.k30.meryl and
Paternal_compress.k30.meryl.
The text was updated successfully, but these errors were encountered: