-
Hi @brianhie |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
Hi @ayeTown, My primary recommendation would be to do DE on the full data, without sketching. Many standard DE tests are efficient with the number of datapoints, so there shouldn't be any needed to perform sketching beforehand. If you do need to downsample, I would uniformly downsample within each class, since one concern is that data-dependent downsampling will change the distribution of cells, leading to potentially misleading DE results. I think in the case of DE, you should probably just stick to more standard downsampling techniques. |
Beta Was this translation helpful? Give feedback.
-
Can you clarify what you mean by "within each class". Do you mean within each cell type? I was thinking of using geosketch for each sample ID (different biological replicate) separately so that the distribution of cell types would not change significantly within each sample. |
Beta Was this translation helpful? Give feedback.
Hi @ayeTown,
My primary recommendation would be to do DE on the full data, without sketching. Many standard DE tests are efficient with the number of datapoints, so there shouldn't be any needed to perform sketching beforehand.
If you do need to downsample, I would uniformly downsample within each class, since one concern is that data-dependent downsampling will change the distribution of cells, leading to potentially misleading DE results. I think in the case of DE, you should probably just stick to more standard downsampling techniques.