Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

meta-features store vector of numbers instead of aggregates #16

Open
janvanrijn opened this issue Oct 5, 2018 · 0 comments
Open

meta-features store vector of numbers instead of aggregates #16

janvanrijn opened this issue Oct 5, 2018 · 0 comments

Comments

@janvanrijn
Copy link
Member

Currently we store (for numeric columns):

  • Mean X of numeric atts
  • Stdev of X of numeric atts
  • Quartile {1, 2, 3} of X of numeric atts
  • Min of X of numeric atts
  • Max of X of numeric atts

Where X = {mean, stdev, kurtosis, skewness}. Something similar for information theoretic measures of nominal atts.

This selection is arbitrary and not well supported in the literature.

Much better would be to store a vector of each value per attribute, giving the possibility to researchers to calculate these values client-side.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant