Parse TRIAL_VARs #990

saeub · 2025-03-05T18:47:50Z

Description of the problem

EDF files usually contain DataViewer-compatible messages with trial variables (https://www.sr-research.com/support/thread-83.html). Example from RaCCooNS:

MSG     5141156 TRIALID 8
...
MSG     5149993 !V TRIAL_VAR TRIAL_INDEX 8
MSG     5149994 !V TRIAL_VAR itemnr 199
MSG     5149995 !V TRIAL_VAR sentence Zodra de weg weer vrij is, hebben tient
allen bussen met soldaten voorrang.
MSG     5149996 !V TRIAL_VAR question Is de weg geblokkeerd?
MSG     5149997 !V TRIAL_VAR answer .
MSG     5149997 !V TRIAL_VAR KEY_PRESSED .
MSG     5149998 !V TRIAL_VAR BLOCK 2
MSG     5149999 !V TRIAL_VAR sequence E
...
MSG     5150000 TRIAL_RESULT 0

TRIALID and TRIAL_RESULT denote the start and end of a trial, this can be parsed with the patterns argument in from_asc() without problems. But currently the TRIAL_VAR messages cannot easily be parsed and associated with the corresponding trial, since they can occur at any point between TRIALID and TRIAL_RESULT. In RaCCooNS and MultiplEYE, they occur at the end of the trial (right before TRIAL_RESULT).

Since this is a standard way of encoding trial starts, ends, and variables, I think it would be nice if these would be parsed by default, and trial variables are added as additional_columns automatically.

Description of a solution

By default (or optionally?), samples between TRIALID and TRIAL_RESULT messages should get additional column values:

The trial ID from the TRIALID message
The trial result from the TRIAL_RESULT message
Any trial variables set using !V TRIAL_VAR messages

The values defined by the TRIAL_RESULT and !V TRIAL_VAR messages would have to be "retroactively" added to previous samples as the ASC file is being parsed -- this is probably the main challenge, but shouldn't be too difficult.

Minimum acceptance criteria

Trial IDs, trial results, and trial variables from DataViewer messages are parsed (either by default or optionally) into additional columns of the gaze (and event) dataframe

The text was updated successfully, but these errors were encountered:

SiQube · 2025-03-05T21:59:53Z

looks like an excellent addition to pymovements, would you be interested in providing an initial PR?

dkrako · 2025-03-06T09:19:16Z

Autoparsing trial vars is a great idea!

But wouldn't be adding these as trial columns overkill for the memory consumption? This would add a string like sentence Zodra de weg weer vrij is, hebben tient allen bussen met soldaten voorrang. from your example to every sample. I fear this would blow up.

Instead, let's separate these two functionalities:

auto parsing trial vars into a message data structure
implement functionality to add specific messages to gaze or event frames according to their timestamp

(2.) would be quite helpful outside of parsing and would give the user some flexibility without adding many additional arguments to the from_asc() signature.

saeub · 2025-03-08T00:12:47Z

Thanks for your feedback! I agree that adding all trial variables by default would be too much. But I also think that adding another data structure and a method for adding trial variables after loading might just make it more confusing to use, because it's not obvious and people would have to look for it in the documentation. Maybe we could have an optional argument like from_asc(trial_vars=["TRIAL_INDEX", "itemnr"]) so that we can at least load the most essential trial variables by default in the dataset definitions?

Data files exported with SMI's BeGaze also have built-in messages for trial IDs (but no other trial variables?), so I personally think adding trial IDs by default would be nice.

I'll come back to this after dealing with #945 and #1013.

dkrako · 2025-03-08T10:25:20Z

I think your proposal is a great compromise between usability and performance.
Even the name of the argument could be used as is.

Probably the reason why I'm continually drawn towards a message data structure is the eyelinker package. It can fit data that does not belong anywhere else. It won't be the last time I propose this, but next time I'll consider better alternatives first.

saeub · 2025-03-08T11:28:51Z

Probably the reason why I'm continually drawn towards a message data structure is the eyelinker package.

@dkrako I just looked it up, and that does look like a nice and simple structure for arbitrary messages -- much easier to handle than the custom metadata patterns introduced in #767 (I kind of regret that PR now 😅 I think the behavior is too specific and unintuitive, and #907 would just make it more complex). Maybe not a good fit for the trial variables, since they apply to an entire block and not a single timestamp, but I think a simple way to store user-defined messages would indeed be useful. 🤔

dkrako · 2025-03-12T08:22:40Z

Ah don't worry, we're always smarter in hindsight. We will take care of the metadata dict step by step.

I think both approaches can be supported.

Your idea with the trial_vars argument would be probably most intuitive and would cover most use cases.

A message datastructure and a way to add specific messages to columns would then be a complementing feature.

saeub added the enhancement New feature or request label Mar 5, 2025

saeub mentioned this issue Mar 5, 2025

dataset: add RaCCooNS by Frank and Aumeistere #961

Open

saeub self-assigned this Mar 5, 2025

saeub added the parsing label Mar 6, 2025

dkrako added the essential important label Mar 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parse TRIAL_VARs #990

Parse TRIAL_VARs #990

saeub commented Mar 5, 2025 •

edited

Loading

SiQube commented Mar 5, 2025

dkrako commented Mar 6, 2025 •

edited

Loading

saeub commented Mar 8, 2025

dkrako commented Mar 8, 2025

saeub commented Mar 8, 2025

dkrako commented Mar 12, 2025

Parse TRIAL_VARs #990

Parse TRIAL_VARs #990

Comments

saeub commented Mar 5, 2025 • edited Loading

Description of the problem

Description of a solution

Minimum acceptance criteria

SiQube commented Mar 5, 2025

dkrako commented Mar 6, 2025 • edited Loading

saeub commented Mar 8, 2025

dkrako commented Mar 8, 2025

saeub commented Mar 8, 2025

dkrako commented Mar 12, 2025

saeub commented Mar 5, 2025 •

edited

Loading

dkrako commented Mar 6, 2025 •

edited

Loading