Skip to content

Should I run geosketch on samples separately? #14

Answered by brianhie
ayeTown asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @ayeTown,

My primary recommendation would be to do DE on the full data, without sketching. Many standard DE tests are efficient with the number of datapoints, so there shouldn't be any needed to perform sketching beforehand.

If you do need to downsample, I would uniformly downsample within each class, since one concern is that data-dependent downsampling will change the distribution of cells, leading to potentially misleading DE results. I think in the case of DE, you should probably just stick to more standard downsampling techniques.

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Answer selected by ayeTown
Comment options

You must be logged in to vote
1 reply
@brianhie
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants