Clean up reader-mapper-parser infrastructure, use multiformat reader #19

lukaspie · 2024-02-23T13:43:24Z

Currently, the reader implements a three-layer structure:

there is the XpsReader(BaseReader)
then there's a layer called "mappers" (see here for an example)
these mappers themselves actually call "parsers" (example).

The mappers are used for one file format (like the sle format from SPECS) and then I have a logic that calls a parser for a specific subsets of such files, e.g. depending on the software version that was used for this file. That allows me to keep functionality across different, yet very similar versions of a format by inheritance and abstract base classes.

All of those sub-classes could be readers themselvers, inheriting from our BaseReader class (or the MultiFormatReader developed in FAIRmat-NFDI/pynxtools#250).

The file extension is often not unique, i.e., many vendors have a .txt export, but all the files are actually different. But this logic could probably be handled by passing a function in the extensions dict of the MultiFormatReader that does this. This is already being handled similary. And finally, there should be be a check that the file comes from the list of supported vendors.

The text was updated successfully, but these errors were encountered:

lukaspie · 2024-11-22T09:06:23Z

Closed as sufficiently addressed

lukaspie added enhancement New feature or request sub-reader labels Feb 23, 2024

lukaspie self-assigned this Feb 23, 2024

lukaspie changed the title ~~Clean up reader-mapper-parser infrastructure~~ Clean up reader-mapper-parser infrastructure, use multiformat reader Feb 28, 2024

lukaspie mentioned this issue Mar 27, 2024

Add sub-reader for PHI Versaprobe data #5

Merged

2 tasks

lukaspie closed this as completed Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up reader-mapper-parser infrastructure, use multiformat reader #19

Clean up reader-mapper-parser infrastructure, use multiformat reader #19

lukaspie commented Feb 23, 2024

lukaspie commented Nov 22, 2024

Clean up reader-mapper-parser infrastructure, use multiformat reader #19

Clean up reader-mapper-parser infrastructure, use multiformat reader #19

Comments

lukaspie commented Feb 23, 2024

lukaspie commented Nov 22, 2024