Major refactoring for upstream changes / blocks #92

Lestropie · 2024-04-26T05:13:33Z

Replaces #90.

Resolving the BEP against the absence of both complex inheritance and suffix distinction is not as simple as removing complex inheritance and swapping suffices; attempting to do so just breaks everything around it. But given prior precedent I'm pessimistic about my ability to convey this convincingly. So I'm better off making the changes necessary to conform to the main structural constraints (no IP changes, single fixed suffix) that will both retain compatibility with extension beyond the most trivial of diffusion models, and hopefully not be objectionable on other bases.

I've ended up making a much broader scope of changes than just "doing #90 differently", as there are too many moving parts to handle in isolation. I'll try to break up my description here in terms of aspects that address those core structural constraints, vs. other things that to greater or lesser necessity got changed along the way. I can't finish this off right now as I have other urgent tasks, but hopefully this is enough to elicit opinions.

Primary structure

One sidecar file per model parameter data file; no advanced inheritance principle required (and no smuggling in complex inheritance by other means).
Distinguish between "model metadata" and "parameter metadata".
(Serves similar purpose to the former MFP / MDP distinction, but applies exclusively in the context of metadata, not image data)
Duplicate model metadata across all sidecars.
Use metadata sub-dictionary to separate metadata applicable to a model as a whole from metadata applicable to specific parameters.

Other changes

More extensive use of metadata sub-dictionaries.
In particular, those metadata fields specific to the interpretation of data across a NIfTI image axis as encoding some anisotropic information have been collated into their own sub-dictionary.
More explicit about image axes used for orientation encoding vs. bootstrapping
Having to try to make the bedpostx example more mature, and thinking about future orientation encoding types, I deemed it necessary to be explicit about which image axes encode orientation information vs. different bootstrap realisations of a model. There are some images that have orientation information but no bootstrapping, some that have bootstrap realisations but no orientation information. Then, in the future, there are orientation encoding types that will require multiple image axes (I'm thinking SHARD and DSI PDFs). So I'm not convinced that saying "bootstrapping comes after orientation stuff" is adequate, and would prefer to be explicit about both.
Removal of fields about which gradients / shells were utilised in the model fit (DWI directions utilised in model #48).
I don't think it makes sense to specify which volumes / shells were utilised, but not what image data were used, especially when the former was already incomplete. I tried adding the latter, but it started to look an awful lot like provenance.
Replace "data representations" / "orientation representation" with "orientation encoding" ("OrientationRepresentation" rename #91).
Non-negativity is not a constraint of a model as a whole; it is a constraint that may be applied to individual parameters of that model. Therefore I've stored it on an individual parameter basis, rather than being a single field applicable to the model as a whole.
Removed reserved labels of models and parameters.
I think these do more harm than good; it makes the whole ecosystem inflexible, might be misinterpreted as not supporting anything outside of that scope, and runs into a weird situation where the corresponding entities are neither arbitrary labels nor define a finite restricted set of options. If the demonstrative examples are moved out of the document, then their scope could be increased to exemplify some of these.
The counter to this would be that by controlling the entity labels, specific parametric maps generated by one pipeline would be possible to robustly identify by another. While I'm myself interested in this, I'm not sure that partial control of entity labels solves that problem without consequence. Elsewhere there's discouragement of placing too much automated interpretation on entity labels (eg. _dir- to discover phase encoding information). And for such abbreviated labels, hard to dictate that a dataset would be non-conformant if some App / user were to store an image with a two-character label that encodes a parameter other than the one that the specification attributes to that two-character label.

Outstanding questions

For FSL bedpostx orientations when stored in spherical coordinates:
- Is the interpretation of polar angles conformant with the ISO specification here, or is a transformation required?
- Are they truly defined with respect to the image axes, or do they conform to the same convention as bvecs, which has a flip along the first axis in the case of a positive determinant?
For spherical deconvolution with multiple kernels, should metadata encoding the response functions be defined only with respect to the parameter corresponding to each, or should all response functions be defined within the scope of model metadata?
Demonstrative examples section becomes even longer when you've got metadata duplicated across each JSON contents. It's now more than half the document. I would prefer to defer this entirely to an external repository where exemplar data are explicitly stored. This would also better facilitate demonstrating scenarios where there are multiple possible solutions; eg. bedpostx concatenating all stick orientations / volume fractions into images versus having each parameter for each stick stored as its own image.
For metadata, possible that software implementation / model description / description of individual encoded parameter could all have their own URLs.

- Reject suffices "mfp" (or earlier "_model") / "mfp" in favour of "_dwimap". Instead adopt distinction of "model metadata" vs. "parameter metadata". - Reject use of advanced inheritance in favour of listing all relevant metadata for each data file in the sidecar JSON. - Greater use of sub-dictionaries in JSON files to assist in separating metadata relevant to a model as a whole vs. only that particular parameter of the model.

arokem

Thanks for all the work on this. I think that it presents a good and viable way forward. I think that I'll defer discussion on some of the finer points for a PR against the main spec (as suggested in #24 (comment)). For now, if I understand correctly, the main question to present to others is whether we can reach broad consensus that duplication of metadata across files is the "least of evils" among our various viable options. At the risk of stating the obvious, the main tension here is that this in contradiction to the RECOMMENDED corollary 2 of the inheritance principle, but in compliance with the more strictly defined MUST NOT rule 4.

src/derivatives/05-diffusion-derivatives.md

arokem · 2024-04-30T16:33:19Z

src/derivatives/05-diffusion-derivatives.md

+
+        -   WOULD SPLITTING STICK COMPONENTS ACROSS NIFTIS
+            REQUIRE A NEW ENTITY BY WHICH TO INDEX THEM?
+            OR JUST GIVE THEM EG. `_param-spherical1`, `_param-spherical2`?


I haven't used bedpostx in a really long while, and even then only very superficially. I would love to get input from users of the tool who can tell us more about how they anticipate using these files and what they'd prefer.

Perhaps defer dedicated discussions to #93 / #94.

arokem · 2024-05-10T15:24:53Z

Any objections to merging this into #24 so that we can fix up things there and keep moving towards a PR on the bids-standard/bids-specification? If I get no objections by mid next week, I will go on ahead with that and we can follow up with more PRs on top of this one (specifically to address the "notes" and "todo" sections of current PR).

Lestropie · 2024-05-14T04:38:39Z

I'm eager to merge it to start chipping away at other things. I've been writing code to verify appropriate interpretation of fibre orientations (not at the scope of #23, mostly trying to get an answer to #96), and have already identified one aspect where the spec will need to be augmented to support FSL data.

Lestropie added 3 commits April 26, 2024 13:37

BEP016: Fis JSON formatting in demonstrative examples

a2c35b8

Diffusion models: Forbid negative spherical coordinate radius

8672cbe

arokem mentioned this pull request Apr 30, 2024

Current state of the BEP (NOT FOR MERGING) #24

Draft

arokem reviewed Apr 30, 2024

View reviewed changes

arokem approved these changes Apr 30, 2024

View reviewed changes

Lestropie mentioned this pull request May 1, 2024

bedpostx stick component concatenation #93

Open

Diffusion model derivatives: Spelling fixes

7a123d5

arokem merged commit 1b50e3e into bep-016 May 14, 2024
10 of 11 checks passed

Lestropie deleted the dwimap_and_no_inheritance branch May 15, 2024 03:19

This was referenced May 15, 2024

Addition of new entities / suffixes #55

Open

BEP016: Forced data representation specification #67

Closed

Lestropie mentioned this pull request May 15, 2024

DWI derivatives: Provide diffusivity units #53

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Major refactoring for upstream changes / blocks #92

Major refactoring for upstream changes / blocks #92

Lestropie commented Apr 26, 2024

arokem left a comment

arokem Apr 30, 2024

Lestropie May 1, 2024

arokem commented May 10, 2024

Lestropie commented May 14, 2024

Major refactoring for upstream changes / blocks #92

Major refactoring for upstream changes / blocks #92

Conversation

Lestropie commented Apr 26, 2024

Primary structure

Other changes

Outstanding questions

arokem left a comment

Choose a reason for hiding this comment

arokem Apr 30, 2024

Choose a reason for hiding this comment

Lestropie May 1, 2024

Choose a reason for hiding this comment

arokem commented May 10, 2024

Lestropie commented May 14, 2024