Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Subsample features #235

Closed

Conversation

Justin-J-Miller
Copy link
Collaborator

Fixes #232 by:

  • Adding reassignment for features
  • Removing code in cluster.py preventing passing both subsample and features.
  • Changes math around expected array lengths for subsampling features (this was hacky- would appreciate alternate thoughts).

I wonder if it's also worth looking at file handling for features files? I noticed the loader is expecting a h5 file if only one set of features is passed, but a series of numpy files if multiple are passed. @justinrporter do you know why this was preferred? I could try and add some file extension logic as it might be nice to pass multiple h5 files (e.g. multiple ragged datasets), or a single numpy file (single homogenous dataset). Happy to do this as the same or a separate PR.

@Justin-J-Miller
Copy link
Collaborator Author

Temporarily closing this as there are more changes that should be added.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Allow for subsampling of featured trajectories
1 participant