Advice on memory usage? #2090

bl24 · 2025-03-04T22:02:41Z

Hi, I'm working on a single end amplicon dataset with three libraries. Here are the results of running filtering

                                                  reads.in   reads.out

mimhigh_S4_R2_001.fastq.gz 7081626 6963273
mimlib_S5_R2_001.fastq.gz 10966462 10890100
mimlow_S5_R2_001.fastq.gz 6390801 6305400

For the samples labeled mimhigh and mimlow I was able to run error rates and dada fine. Only took me a few hours using my normal HPC settings: 28 cpus/4GB per cpu. I've been trying for days now to process the mimlib using different memory settings with no luck. At one point I had ran an HPC job with 6cpus/32GB per cpu for 24 hours and that wasn't enough to get the analysis done.

I know mimlib has a lot more data but is that normal to require so much more computing power?

The text was updated successfully, but these errors were encountered:

benjjneb · 2025-03-05T22:47:48Z

I'm not sure exactly what youre "CPUs" number means, but DADA2 is not internally parallelized across compute nodes. It will use all the threads available to it (if multithread=TRUE) within a discrete compute unit, but it cannot and does not use multiple compute nodes or physically discrete CPUs.

Perhaps this addresses your question? This also relates to the memory issue -- what you want is a single compute node with potentialy higher memory -- maybe 64GB. Adding more nodes with 4GB a piece won't help.

bl24 · 2025-03-05T23:47:38Z

I'm submitting the analysis as a SLURM job, so by "CPUs" I'm referring to the ntasks, which I think refers to the number of cores my job is running on. I've only been submitting a single compute node because I saw you mention in a previous issue that DADA2 doesn't compute across nodes.

benjjneb · 2025-03-06T01:24:22Z

Try 64GB allowance then? The memory requirements of the core DADA2 algorithm scale like the number of unique sequence sin the data squared. So your 10M vs. 6M data, if its the same kind of sample, would be expected to require (10/6)^2 ~ 2x more memory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Advice on memory usage? #2090

Advice on memory usage? #2090

bl24 commented Mar 4, 2025 •

edited

Loading

benjjneb commented Mar 5, 2025

bl24 commented Mar 5, 2025

benjjneb commented Mar 6, 2025

Advice on memory usage? #2090

Advice on memory usage? #2090

Comments

bl24 commented Mar 4, 2025 • edited Loading

benjjneb commented Mar 5, 2025

bl24 commented Mar 5, 2025

benjjneb commented Mar 6, 2025

bl24 commented Mar 4, 2025 •

edited

Loading