Need unit tests for calc...() popgen functions #474

bhaller · 2024-09-25T17:28:25Z

The need for unit tests for SLiM's popgen functions has been underlined by another discovery of a bug with them (https://groups.google.com/g/slim-discuss/c/Yacfk9EIYeU/m/bc72wVUzBAAJ). I'm not sure how to test them, though. I suppose a test could construct a population with known mutations, placed into the genomes at known positions/frequencies, and then test that the value calculated by the function matches the expected value calculated independently from first principles or by other software. If someone can supply me with a test scenario and an expected value, I can construct a corresponding SLiM test, but I don't have the knowledge necessary to come up with appropriate scenarios and expected values. These test scenarios wouldn't need to be large/complex; even a test with a genome of say, ten base positions long with, say, five mutations present and four diploid individuals (eight genomes) would be quite sufficient to test that the math and logic are correct, I would think. It would be good to have such tests for all of the calc...() functions. Perhaps @npb596 or @petrelharp or @philippmesser could help me with this?

bhaller · 2024-09-25T17:30:27Z

This could take the form of a VCF file and an expected value. My SLiM test could simply load the VCF and check for a match (within reasonable numerical tolerance) to the expected value.

petrelharp · 2024-09-25T18:52:48Z

My recommendation is to not do expected values from theory (if that's what you meant); instead compare to the value calculated independently - either by other software or by a separate, first-principles implementation.

I don't want to take this on right now though - maybe a good student project?

bhaller · 2024-09-25T18:57:36Z

OK. Why not expected values from theory?

petrelharp · 2024-09-25T19:14:14Z

Because that is so much more complicated - you have to worry about statistical power; how close is "close enough"; etcetera. That sort of thing is good for validation, but not so good for unit tests (for one thing you end up having to run a lot of simulatiosn to make sure). What we do in tskit, for instance, is usually just pull up the definition of the thing, then code up some real simple implementation that doesn't worry about efficiency; and compare to that. msprime does have a whole validation.py script that does statistical comparisons to other simulation software; but that's a much messier thing.

bhaller · 2024-09-25T19:32:05Z

Because that is so much more complicated - you have to worry about statistical power; how close is "close enough"; etcetera. That sort of thing is good for validation, but not so good for unit tests (for one thing you end up having to run a lot of simulatiosn to make sure). What we do in tskit, for instance, is usually just pull up the definition of the thing, then code up some real simple implementation that doesn't worry about efficiency; and compare to that. msprime does have a whole validation.py script that does statistical comparisons to other simulation software; but that's a much messier thing.

Aha, I see. Yes, there are certainly problems with doing statistical tests for validation. SLiM already does tons of them, though. But if a precise comparison to the "right answer" is possible, that's certainly better!

bhaller added bug help wanted labels Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need unit tests for calc...() popgen functions #474

Need unit tests for calc...() popgen functions #474

bhaller commented Sep 25, 2024

bhaller commented Sep 25, 2024

petrelharp commented Sep 25, 2024

bhaller commented Sep 25, 2024

petrelharp commented Sep 25, 2024

bhaller commented Sep 25, 2024

Need unit tests for calc...() popgen functions #474

Need unit tests for calc...() popgen functions #474

Comments

bhaller commented Sep 25, 2024

bhaller commented Sep 25, 2024

petrelharp commented Sep 25, 2024

bhaller commented Sep 25, 2024

petrelharp commented Sep 25, 2024

bhaller commented Sep 25, 2024