Create t-test comparison across clustered features. #56

EricMartin827 · 2020-04-01T18:34:55Z

To evaluate how well a clustering algorithm produces classifiable features, we need a way to test for statistically significant differences in cluster assignments across classes. Write a t_test function which measures the difference in cluster assignment over all features between healthy control patients and schizophrenic positive patients.

If there are X healthy and Y schizophrenic patients with D features (clusters assigned to time window/interval), then this function will produce a 1-D array of p_values comparing cluster assignment means between the two sets of patients.

bbradt · 2020-04-02T12:49:06Z

looks good!

It would be cool if you could generalize the T-Test so that we can also apply it to the cluster-centers between classes. For clustering, I get out a set of K cluster centers in COMPONENT x COMPONENT space. If I take instances belonging to only one class within one cluster, and do a two-tailed t-test between these class-specific instances, I should get backed a COMPONENT x COMPONENT significance matrix, that will show us differences within the clusters themselves.

This isn't necessarily useful for informing supervised learning, but it's something we do to evaluate differences between the populations, so it's worth doing if it's not too difficult.

EricMartin827 self-assigned this Apr 1, 2020

bbradt mentioned this issue Apr 2, 2020

Visualization of Cluster Centers across methods #60

Open

bbradt added enhancement New feature or request supervised unsupervised labels Apr 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create t-test comparison across clustered features. #56

Create t-test comparison across clustered features. #56

EricMartin827 commented Apr 1, 2020

bbradt commented Apr 2, 2020

Create t-test comparison across clustered features. #56

Create t-test comparison across clustered features. #56

Comments

EricMartin827 commented Apr 1, 2020

bbradt commented Apr 2, 2020