-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing data #99
Comments
With my MDF hat on: With my IBI hat on: |
But what if the field is non-optional?
Its actually two problems:
|
That case is somewhat hypothetical. We tried to make only those fields mandatory that are really necessary to use the data. Regarding the concrete field: Maybe we should just add |
We have |
From my experience I can say that we occasionally have to deal with missing data, e.g. when converting measurements of our Bruker system, which e.g. does not store the size of the delta sample, but just its volume. Our current approach is to then assign some non nonsensical value like e.g. 0x0x0 mm³ to the size. However, would it not be better to have a standardized and documented way of dealing with missing data?
In case where a whole data set is unknown or missing we could use empty data sets as proposed here https://docs.h5py.org/en/stable/high/dataset.html and here https://support.hdfgroup.org/HDF5/Tutor/crtdat.html.
In case where only single elements of a data set are unknown we could use
NaN
https://stackoverflow.com/questions/33656043/hdf5-how-to-handle-empty-rows.I think former or later every group using the format will likely run into this issue and this could help keep the number of undocumented solutions at bay.
The text was updated successfully, but these errors were encountered: