Generalize well organization in high-content screening: field of view => image #137

jluethi · 2022-08-29T13:12:45Z

I would like to suggest a change to the wording of the OME-NGFF HCS plate specification and add some recommendations about performance for visualization vs. structure of image pyramids per well. Specifically, I propose that we explicitly allow for whole wells being saved as a single image as part of the OME-NGFF spec. As a conclusion of this, the components of the wells would be images, not field of views (because the image could consist of multiple field of views stitched together already).

Motivation

We would like to use OME-Zarr files to store TB-sized multi-channel, 3D high content imaging data in the HCS format. We are building an open-source image processing pipeline to process data in HCS OME-Zarr called Fractal. One of the benefits of saving such large datasets in OME-Zarrs is the possibility of interactive image visualization, e.g. in the napari viewer. When we were testing the scalability of this approach to large HCS plates, we discovered issues with saving all the field of views of the microscope as separate field of views in each well of the OME-Zarr file.
We started the discussion about this topic here: ome/ome-zarr-py#200
The discussion on the approach of saving single images per well starts here in more detail: ome/ome-zarr-py#200 (comment)

To very briefly summarize it:
By saving many field of views (FOVs) per well as separate images with the whole pyramid hierarchy leads to very suboptimal IO challenges. To visualize plates at low resolution, a tiny pyramid file needs to be loaded for each field of view. When a plate has >1000 field of views across all its wells, this becomes very, very slow. Even for a case with just 72 field of views and just 3 pyramid levels, loading was already 8 times slower with the FOVs saved as separate image pyramids vs. a single image pyramid. This seems to be quite a fundamental issue of how fast many small files vs. a single large file can be accessed and would likely get worse when using object storage vs classical file systems. See further details in the issues above

Thus, our solution to this has been to store our wells as a single, fused images for each well. In discussions on this issue, there was an openness to this approach being part of the spec. Thus, I have created this PR to suggest a change that would explicitly allow this and mentions the trade-offs. I hope this PR can be the place to discuss this further and see whether it can make it into the ome-ngff spec.

Open questions

How should we specify the trade-offs? I'm proposing a "Note" here, but open to other implementations. Also, is this specification of Note correct? Does it work for multi-line paragraphs?

Is the explanation of the trade-offs understandable? See here: 20261ac

Note: Trade-offs on how data is structured per well:
Field of views of the microscope MAY be saved as individual images in each
well to allow for maximal flexibility regarding translations between field of views.
Having wells with many individual images does not scale for visualisation of
large plates. Visualisation tools would then need to read all the tiny pyramid
files for each field of view to create overviews and this IO performance becomes
a big limiting factor. In that case, all the field of views SHOULD be saved as
a single, combined image. In that way, the pyramid chunks can be kept at a
reasonable size for low-resolution representations of a well.

I think it is important to get away from the field of view naming in the spec when wells can be collections of images. But there are two keys in the plate metadata that contain the name field. How should one proceed with these?
Specifically, maximumfieldcount (does it describe max field of views per well? Or in total? ⇒ is the wording of images per well correct? Or would it be images in the whole plate (though then what is “max”, isn’t that just a count)?) and field_count (is that per well or per plate? It says “fields per view” ⇒ what is a view?)

github-actions · 2022-08-29T13:12:57Z

Automated Review URLs

will-moore · 2022-08-29T13:56:18Z

Thanks for that.
I feel that MAY and SHOULD terms are about the rules of the Spec itself and probably shouldn't be used in this context? I think you can drop 1 or 2 sentences and be a bit less explicit, and users will still understand. How about this:

"Field of views of the microscope may be saved as individual images in each
well to allow for maximal flexibility regarding translations between field of views.
However, having wells with many individual images does not scale well for visualisation of
large plates. In that case, combining the fields and saving as a single image per Well is likely to
improve performance."

will-moore · 2022-08-29T13:59:08Z

maximumfieldcount is the largest number of fields in any single Well. Please feel free to modify the description of this term to clarify this in the spec. Comes from OME model: https://www.openmicroscopy.org/Schemas/Documentation/Generated/OME-2016-06/ome.html

jluethi · 2022-08-29T17:03:06Z

Thanks @will-moore

How about this

Sounds great, I shortened it that way

modify the description

Thanks for the confirmation. In that case, I guess it needs to remain being called maximumfieldcount & my wording change should be correct. I slightly updated the field_count to be (hoepfully) more clear as well

will-moore

Looks good, thx 👍

jluethi · 2022-10-26T17:03:00Z

@will-moore Just checking in: What is the process or timeline to get this change into the OME-NGFF spec? Is there a chance it will be part of the 0.5 spec? Do I need to talk to some people or convince someone else first that this would be a good idea?
No stress at all, just wanted to check in whether I should be doing something about this PR :)

will-moore · 2022-11-03T10:19:18Z

I would expect this to be included in v0.5 spec, especially since it's more like advice than a change in spec.
Anything else needed here @sbesson?

sbesson

No objection from my side. We might want whoever will be driving the 0.5 roadmap to also quickly sign-off.

From a naming perspective:

the usage of images increases the consistency with the terminology used in the well specification
from the closest equivalent model, a WellSample in the OME model is defined as an image captured within a well

Regarding the discussion between alternative layouts and their suitability for different application contexts, I do not have a better suggestion than the note. Two comments:
1- this discussion applies outside the context of HCS data i.e. storing unstitched vs stitched images,
2- other decisions have similar trade-offs (chunking size, chunk dimensions, resolution granularity).
I anticipate the information about these trade-offs might be reworked as the specification evolves.

imagesc-bot · 2023-03-27T08:42:34Z

This pull request has been mentioned on Image.sc Forum. There might be relevant details there:

https://forum.image.sc/t/faim-hcs-functions-to-work-with-hcs-data/78868/11

imagesc-bot · 2023-10-03T19:43:57Z

This pull request has been mentioned on Image.sc Forum. There might be relevant details there:

https://forum.image.sc/t/using-naparis-new-not-yet-released-async-functionality-to-browse-large-ome-zarr-hcs-plates/86984/1

imagesc-bot · 2023-11-23T12:08:26Z

This pull request has been mentioned on Image.sc Forum. There might be relevant details there:

https://forum.image.sc/t/best-approach-for-appending-to-ome-ngff-datasets/89070/3

imagesc-bot · 2024-02-22T15:51:16Z

This pull request has been mentioned on Image.sc Forum. There might be relevant details there:

https://forum.image.sc/t/fractal-framework-zarr-compatibility/92536/2

psobolewskiPhD · 2025-01-30T20:42:24Z

Gentle bump here as I just ran into this and found it quite surprising.
Have HCS plates with 25 FoV per well, so previewing a whole plate is brutally bad.

jluethi · 2025-01-31T08:49:40Z

Thanks for pinging this @psobolewskiPhD !

We actually fully switched to the approach of stitching multi-FOV wells into single arrays and do this by default for all of our current converters now (see converter packages for Fractal here: https://fractal-analytics-platform.github.io/fractal_tasks/).

For us, that makes visualizing large plates with many FOVs per well work quite well. Happy to have an exchange on that in case you're interested.

And we should find a good way to push this. I'll certainly include it in upcoming discussions on collections user stories for the HCS part.

will-moore · 2025-01-31T08:57:41Z

@sbesson and I have approved this PR, so it's looking good, but it does need rebasing onto main.
However latest/* is now a symlink to the v0.5 spec and any changes there are effectively a release of the spec - see #292 (comment).

latest/index.bs

@imagejan

Improve syntax based on proposal from @imagejan Co-authored-by: Jan Eglinger <[email protected]>

jluethi added 2 commits August 29, 2022 14:37

Change field of view to image in well content

55e28bf

Add note about trade-offs for single- vs multi-image wells

20261ac

jluethi mentioned this pull request Aug 29, 2022

Read multiple fields of view on a grid, for HCS dataset ome/ome-zarr-py#200

Open

Improve wordingwith feedback from @will-moore

03e2812

will-moore previously approved these changes Aug 31, 2022

View reviewed changes

jluethi mentioned this pull request Sep 8, 2022

Handling “acquisitions” in plate & well reading ome/ome-zarr-py#225

Open

sbesson previously approved these changes Nov 8, 2022

View reviewed changes

This was referenced Jan 19, 2023

Plate loading for wells with varying zyx dimensions ome/ome-zarr-py#240

Open

[WIP] Support plate loading for varying well sizes (ref #240) ome/ome-zarr-py#241

Open

jluethi mentioned this pull request Oct 25, 2023

Define table specifications fractal-analytics-platform/fractal-tasks-core#582

Merged

1 task

jluethi mentioned this pull request Sep 19, 2024

Vizarr: issues about well viewer and resolution fractal-analytics-platform/fractal-vizarr-viewer#24

Open

imagejan reviewed Jan 31, 2025

View reviewed changes

latest/index.bs Outdated Show resolved Hide resolved

Update latest/index.bs

7f7fa21

Improve syntax based on proposal from @imagejan Co-authored-by: Jan Eglinger <[email protected]>

jluethi dismissed stale reviews from sbesson and will-moore via 7f7fa21 February 4, 2025 16:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize well organization in high-content screening: field of view => image #137

Generalize well organization in high-content screening: field of view => image #137

jluethi commented Aug 29, 2022 •

edited

Loading

github-actions bot commented Aug 29, 2022 •

edited

Loading

will-moore commented Aug 29, 2022

will-moore commented Aug 29, 2022 •

edited

Loading

jluethi commented Aug 29, 2022

will-moore left a comment

jluethi commented Oct 26, 2022

will-moore commented Nov 3, 2022

sbesson left a comment

imagesc-bot commented Mar 27, 2023

imagesc-bot commented Oct 3, 2023

imagesc-bot commented Nov 23, 2023

imagesc-bot commented Feb 22, 2024

psobolewskiPhD commented Jan 30, 2025

jluethi commented Jan 31, 2025

will-moore commented Jan 31, 2025

Generalize well organization in high-content screening: field of view => image #137

Are you sure you want to change the base?

Generalize well organization in high-content screening: field of view => image #137

Conversation

jluethi commented Aug 29, 2022 • edited Loading

Motivation

Open questions

github-actions bot commented Aug 29, 2022 • edited Loading

Automated Review URLs

will-moore commented Aug 29, 2022

will-moore commented Aug 29, 2022 • edited Loading

jluethi commented Aug 29, 2022

will-moore left a comment

Choose a reason for hiding this comment

jluethi commented Oct 26, 2022

will-moore commented Nov 3, 2022

sbesson left a comment

Choose a reason for hiding this comment

imagesc-bot commented Mar 27, 2023

imagesc-bot commented Oct 3, 2023

imagesc-bot commented Nov 23, 2023

imagesc-bot commented Feb 22, 2024

psobolewskiPhD commented Jan 30, 2025

jluethi commented Jan 31, 2025

will-moore commented Jan 31, 2025

jluethi commented Aug 29, 2022 •

edited

Loading

github-actions bot commented Aug 29, 2022 •

edited

Loading

will-moore commented Aug 29, 2022 •

edited

Loading