switch to interval_[start/end] + allow forecast non-zero first step #64

dfulu · 2024-10-07T17:29:37Z

In this PR we:

Update the config and function parameters to use interval_[start/end]
Allow the NWP to have a first forecast step which is not valid at the init-time
Slight tidying

Closes #22

…t step time

ocf_data_sampler/select/select_time_slice.py

AUdaltsova · 2024-10-08T11:23:24Z

ocf_data_sampler/torch_datasets/pvnet_uk_regional.py

+                interval_start=minutes(nwp_config.interval_start_minutes),
+                interval_end=minutes(nwp_config.interval_end_minutes),
                dropout_timedeltas=minutes(nwp_config.dropout_timedeltas_minutes),
                dropout_frac=nwp_config.dropout_fraction,


Looking at the way sat delay is handled now, do we ever actually use the dropout fraction system for NWPs (or at all)? Because if we always just set it to one delay that is always used, then we can just do it through interval_end as well.

Though to be honest I am a bit on the fence about factoring the delay into interval start/end, I think it can get messy and is very susceptible to human error. I like the way current NWP dropout system lets you say "I want x amount backward and y amount forward, latest you can get me" and it factors in the delay for you. When setting up sat config it used to be "I want an hour of history, but the delay is 30 minutes, so actually I need to request 90 minutes of history" etc, which is bad, and moving to "I want an hour of history with delay 30 min, so I need everything between -90 min and -30 min" is better, but still a long way away from "I want this much, you'll probably need to wait that much for it to come in, go figure it out".

What am I missing?

Looking at the way sat delay is handled now, do we ever actually use the dropout fraction system for NWPs (or at all)? Because if we always just set it to one delay that is always used, then we can just do it through interval_end as well.

You're right we always set the NWP dropout to a constant value. I'm not sure how we would use interval_end for that though. Currently interval_start/end define the slice along the step dimension. t0 and dropout_timedeltas_minutes define the slice along the init-time dimension.

I like the way current NWP dropout system lets you say "I want x amount backward and y amount forward, latest you can get me" and it factors in the delay for you.

That hasn't really changed in this PR. We are basically just renaming the history and forecast parameters. So after this PR it is "I want between [t0+x, and t0+y], the latest init-time you can get me". For NWP it does still factor in the delay for you.

I think this PR gives a cleaner way of expressing the time slice. For the manchester prize satellite prediction input I want to slice between t0+15 minutes and t0+3 hours. Under our old system I'd need to say I wanted negative 15 minutes of history which seems clumsy.

When setting up sat config it used to be "I want an hour of history, but the delay is 30 minutes, so actually I need to request 90 minutes of history" etc, which is bad, and moving to "I want an hour of history with delay 30 min, so I need everything between -90 min and -30 min" is better, but still a long way away from "I want this much, you'll probably need to wait that much for it to come in, go figure it out".

Personally I like the explicitness of setting everything relative to t0. To me it better highlights the data we expect to be available in production -> i.e. we can't delete production data until it is more than 90 minutes stale. But I'm open to suggestions for how else we might parameterise this

You're right we always set the NWP dropout to a constant value. I'm not sure how we would use interval_end for that though. Currently interval_start/end define the slice along the step dimension. t0 and dropout_timedeltas_minutes define the slice along the init-time dimension.

Yeah good point, my bad!

I think this PR gives a cleaner way of expressing the time slice. For the manchester prize satellite prediction input I want to slice between t0+15 minutes and t0+3 hours. Under our old system I'd need to say I wanted negative 15 minutes of history which seems clumsy

I think this is the use case I was missing, it makes a lot of sense now, thanks!

Not a hill I will die on, but for what it's worth: I think this intervaling is convenient, especially if you want an "unconventional" sat slice (might even enable 1 image of sat history that I previously had to hack around to get with forecast/history, which is great!) but also because for sat data init_time and step are the same thing, it kind of mixes the two together if that makes sense? Which is why it doesn't work for NWP delay setting, and why I initially thought it would. In that sense, I would maybe prefer for the delay to still be set separately and functioning the way NWP delay does and not how it was previously, so to be able to set interval (-60, -15) and delay -30 (btw, hot take but delay already implies moving back, so maybe shouldn't be set in negative? Seems kind of counter-intuitive to me. But there's probably a good reason and also, I digress), instead of interval (-90, -45), or how it was previously, history 90 delay 45 under the assumption that the tail will get cut off.

Main reason: I know at least I am extremely prone to human error in config setting and would like as much automation as possible, and also prefer more single-function things and less multi-function things.

I know we've talked a lot about eradicating live sat delay, and maybe this is a bit excessive parametrising for what it does, but to me it seems a bit more straightforward on user end. Also, feel free to tell me what I missed! Fully expect to be wrong on this one, I have way less experience with sat than you do.

Personally I like the explicitness of setting everything relative to t0. To me it better highlights the data we expect to be available in production -> i.e. we can't delete production data until it is more than 90 minutes stale. But I'm open to suggestions for how else we might parameterise this

Here you'd 100% know better than I do! I get that this is a bit of a have your cake and eat it thing and if your prod cake is more useful than my user cake I'm happy for intervaling to cover delay as well.

By the way, thanks a lot for doing this! I really like the way it's turning out.

I also realised that sat config inherits from DropoutMixin (and always has been), so maybe that's the solution? I know we were going to change that to simplify into just delay, so unless there's something I'm missing I think sat and nwp can use the same mechanism here?

Though dropout is also inherited by GSP and Site configs, and I'm not sure if there are use cases for those that will not be covered by delay.

Switching to (-60, -15) and delay -30 instead of the current (-90, -45)

Personally I see that as slightly more complicated that it needs to be and I prefer having one less parameter. In the case you suggest (-60, -15) and delay -30 is an identical slice to (-45, 0) and delay -45. I think it is messy that we could get the same time slice with different parameter settings. That feels like something which could be prone to human error too

About the use of dropout and delay

I'm not sure if I'm answering your question here, but I think the dropout and delay should be different things. Delay controls the time slice we are choosing and therefore the shape of the input tensor. Dropout controls whether those requested datetimes are actually available or are infilled with NaNs in the array. Admittedly we abuse the "dropout" in the NWP data but I think we should change that to make it clearer.

I think we should get rid of "delay" as we have it currently, and just use the intervals relative to t0. We can then discuss how we name "dropout" across different data sources

Hello again!

I see what you're saying! I guess I just dislike not having the duration of the slice explicitly stated anywhere. Also, this is beside the point probably, but I think it's fine to have different parameter settings to give you the same slice: if I'm reading someone else's config and it says (-90, -45) I'd assume it's delay 45, slice of 45, and if it's actually delay 30 and we don't want to use the newest 15 min for some reason I'd want to know (don't think this would ever happen but I hope you can see what I'm getting at anyway; sorry if it's not too clear!).

Anyway, honestly, I think this conversation doesn't mean we can't merge this, I think intervaling is a good update regardless and we can always redo delay later if we ever want to.

Yeah good point! I agree dropout and delay shouldn't be the same mechanism. I think we were going to redo delay for NWP anyway so that should resolve this maybe. One thing though, and that relates to the previous point: it seems strange to me that delay should control the shape of the tensor; I'd expect it to be decided by the time slice exclusively, and delay to just 'slide' it along the time axis, if you will? But maybe there's a reason for it to be like that. Anyway, I feel like I've unnecessarily turned your PR into a debate club, sorry! Gonna go approve it now.

Thanks! We can come back to this again if we want to change it and I'll add it to the agenda of our next data-sampler meeting

codecov · 2024-11-01T10:19:24Z

Codecov Report

Attention: Patch coverage is 98.07692% with 1 line in your changes missing coverage. Please review.

Project coverage is 94.66%. Comparing base (ef7c83e) to head (c8baf68).
Report is 45 commits behind head on main.

Files with missing lines	Patch %	Lines
ocf_data_sampler/config/model.py	95.65%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #64      +/-   ##
==========================================
+ Coverage   93.05%   94.66%   +1.60%     
==========================================
  Files          22       27       +5     
  Lines         691      824     +133     
==========================================
+ Hits          643      780     +137     
+ Misses         48       44       -4

Flag	Coverage Δ
	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

dfulu added 2 commits October 7, 2024 17:26

switch to interval_[start/end] + allow forecast to have non-zero firs…

c68d584

…t step time

fix test

c7a33a7

dfulu requested a review from AUdaltsova October 7, 2024 17:35

AUdaltsova reviewed Oct 8, 2024

View reviewed changes

ocf_data_sampler/select/select_time_slice.py Show resolved Hide resolved

AUdaltsova reviewed Oct 8, 2024

View reviewed changes

dfulu and others added 2 commits November 1, 2024 10:14

remove unneeded param check

4742ffc

Merge branch 'main' into interval

c8baf68

dfulu requested a review from AUdaltsova November 1, 2024 10:53

dfulu marked this pull request as ready for review November 11, 2024 11:54

AUdaltsova approved these changes Nov 14, 2024

View reviewed changes

Merge branch 'main' into interval

9794788

dfulu merged commit f9fd827 into main Nov 14, 2024
3 checks passed

dfulu deleted the interval branch November 14, 2024 18:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

switch to interval_[start/end] + allow forecast non-zero first step #64

switch to interval_[start/end] + allow forecast non-zero first step #64

dfulu commented Oct 7, 2024 •

edited

Loading

AUdaltsova Oct 8, 2024 •

edited

Loading

dfulu Nov 1, 2024 •

edited

Loading

AUdaltsova Nov 4, 2024

AUdaltsova Nov 5, 2024

dfulu Nov 11, 2024 •

edited

Loading

AUdaltsova Nov 14, 2024 •

edited

Loading

dfulu Nov 14, 2024

codecov bot commented Nov 1, 2024 •

edited

Loading

switch to interval_[start/end] + allow forecast non-zero first step #64

switch to interval_[start/end] + allow forecast non-zero first step #64

Conversation

dfulu commented Oct 7, 2024 • edited Loading

AUdaltsova Oct 8, 2024 • edited Loading

Choose a reason for hiding this comment

dfulu Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

AUdaltsova Nov 4, 2024

Choose a reason for hiding this comment

AUdaltsova Nov 5, 2024

Choose a reason for hiding this comment

dfulu Nov 11, 2024 • edited Loading

Choose a reason for hiding this comment

AUdaltsova Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

dfulu Nov 14, 2024

Choose a reason for hiding this comment

codecov bot commented Nov 1, 2024 • edited Loading

Codecov Report

dfulu commented Oct 7, 2024 •

edited

Loading

AUdaltsova Oct 8, 2024 •

edited

Loading

dfulu Nov 1, 2024 •

edited

Loading

dfulu Nov 11, 2024 •

edited

Loading

AUdaltsova Nov 14, 2024 •

edited

Loading

codecov bot commented Nov 1, 2024 •

edited

Loading