Energy Score Functions for Model Evaluation #289

trobacker · 2025-01-24T02:51:14Z

R script containing functions to get energy scores from model output and produce basic summaries of scores.

src/model_scoring_functions.R

Double braket. Co-authored-by: Zhian N. Kamvar <[email protected]>

src/model_scoring_functions.R

rogersbw

I believe the scoring procedure used here is slightly different from what is discussed here:

https://github.com/reichlab/variant-nowcast-hub?tab=readme-ov-file#background

Happy to meet and discuss on Monday! Overall great coding though!

src/model_scoring_functions.R

… * 100 samples of multinomial counts, on each day/date.

trobacker · 2025-01-26T20:01:09Z

Hi @rogersbw, I modified the ES calculation to include 10000 samples, 100 (posterior draws) * 100 (samples of multinomial counts) = 10000 total on each day/date. Do you mind reviewing that piece (last commit) and confirming that's correct?
(I left a few debugging comments if you want to print a few things locally - I'll remove these later)

It certainly takes longer to run now and I'll plan on optimizing the code next - i.e. replacing for loops with say functional programming techniques. I want to be sure the ES algorithm is correct first. Thanks!

rogersbw

I have one question about handling location date combinations with 0 observed counts. See the comment within the code.

rogersbw · 2025-01-27T16:45:48Z

src/model_scoring_functions.R

+
+        ## Generate 100 multinomial observations for samp_props
+        # Need the N for each loc/day from the validation data
+        N <- sum(subset(targets,


This can be moved outside of the loop, as for each combination of loc and day, the N will stay constant through this loop.

Also, how are day/loc combinations with 0 observations handled? As I'm running through this, it seems to be keeping in the 0 observation days and giving them an energy score of 0 (perfect score). I believe we will want these to be NA values instead.

Thanks for the comments, Ben! I modified the energy scores to have NA when the observed counts sum to 0. I also moved the N assignment. I'll focus on code optimization next.

rogersbw

Sorry to keep sending it back! I had not well thought about handling the unscored location/dates until I saw your message on slack. I believe these corrections should do what we want?

rogersbw · 2025-01-28T16:45:59Z

src/model_scoring_functions.R

+        df_scores <- rbind(df_scores, df_temp)
+        next
+      }
+


Looks good, when we're doing code optimization, could initialize an array of the correct size filled with NA, and then replace only for positive counts.

src/model_scoring_functions.R

trobacker · 2025-01-29T18:56:58Z

As discussed in our modeling meeting today, I'm going to have the energy scores available for all dates/locs and build another function that will filter for ones we've generally agreed to use for final model scoring according to the hub evaluation scheme.
The motivation for this is that some modelers express interest in getting scores for all data used though we will exclude some locs/dates for final scoring on the hub.

Co-authored-by: Ben Rogers <[email protected]>

Anti-join by date, location. Co-authored-by: Ben Rogers <[email protected]>

trobacker · 2025-02-03T19:36:11Z

FWIW, I wrote a different version that I was hoping to be computationally faster which uses expand.grid to avoid nested for-loops, and an apply to sample the multinomials faster, but it actually took 5 seconds longer than my latest commit.

rogersbw · 2025-02-03T20:48:00Z

Looks good, merge away!

Functions to get energy scores from model output and basic summaries.

dc2ca47

trobacker added the needed for eval label Jan 24, 2025

trobacker requested review from nickreich, elray1 and IsaacMacarthur January 24, 2025 02:51

trobacker changed the title ~~Energy Scores Functions for Model Evaluation~~ Energy Score Functions for Model Evaluation Jan 24, 2025

rogersbw self-assigned this Jan 24, 2025

zkamvar reviewed Jan 24, 2025

View reviewed changes

src/model_scoring_functions.R Outdated Show resolved Hide resolved

zkamvar reviewed Jan 24, 2025

View reviewed changes

src/model_scoring_functions.R Show resolved Hide resolved

Update src/model_scoring_functions.R

a261c0d

Double braket. Co-authored-by: Zhian N. Kamvar <[email protected]>

zkamvar reviewed Jan 24, 2025

View reviewed changes

src/model_scoring_functions.R Outdated Show resolved Hide resolved

zkamvar mentioned this pull request Jan 24, 2025

Need instructions for adding new scripts in the src/ folder and adding packages to the renv lockfile. #291

Open

trobacker added 3 commits January 24, 2025 12:34

Update renv with scoringRules and dependency.

48e2977

Remove display clause.

e545545

Using here and amending paths.

e4567be

rogersbw requested changes Jan 24, 2025

View reviewed changes

src/model_scoring_functions.R Outdated Show resolved Hide resolved

src/model_scoring_functions.R Outdated Show resolved Hide resolved

Modify ES calculation to include 10000 samples, 100 (posterior draws)…

0346750

… * 100 samples of multinomial counts, on each day/date.

Remove obsolete variable assignments.

3129064

rogersbw reviewed Jan 27, 2025

View reviewed changes

Place NAs where observed counts sum to 0.

f54a1fe

rogersbw requested changes Jan 28, 2025

View reviewed changes

trobacker and others added 4 commits January 31, 2025 13:04

Update src/model_scoring_functions.R

89cde8c

Co-authored-by: Ben Rogers <[email protected]>

Update src/model_scoring_functions.R

f7ccd5c

Co-authored-by: Ben Rogers <[email protected]>

Update src/model_scoring_functions.R

44e98e1

Anti-join by date, location. Co-authored-by: Ben Rogers <[email protected]>

Add argument to return all energy scores or hub scheme (default)

6d7e1f3

rogersbw approved these changes Feb 3, 2025

View reviewed changes

trobacker merged commit 825c125 into main Feb 3, 2025

trobacker deleted the model-scoring-functions branch February 3, 2025 22:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Energy Score Functions for Model Evaluation #289

Energy Score Functions for Model Evaluation #289

trobacker commented Jan 24, 2025

rogersbw left a comment

trobacker commented Jan 26, 2025 •

edited

Loading

rogersbw left a comment

rogersbw Jan 27, 2025

rogersbw Jan 27, 2025

trobacker Jan 27, 2025 •

edited

Loading

rogersbw left a comment

rogersbw Jan 28, 2025

trobacker commented Jan 29, 2025 •

edited

Loading

trobacker commented Feb 3, 2025 •

edited

Loading

rogersbw commented Feb 3, 2025

Energy Score Functions for Model Evaluation #289

Energy Score Functions for Model Evaluation #289

Conversation

trobacker commented Jan 24, 2025

rogersbw left a comment

Choose a reason for hiding this comment

trobacker commented Jan 26, 2025 • edited Loading

rogersbw left a comment

Choose a reason for hiding this comment

rogersbw Jan 27, 2025

Choose a reason for hiding this comment

rogersbw Jan 27, 2025

Choose a reason for hiding this comment

trobacker Jan 27, 2025 • edited Loading

Choose a reason for hiding this comment

rogersbw left a comment

Choose a reason for hiding this comment

rogersbw Jan 28, 2025

Choose a reason for hiding this comment

trobacker commented Jan 29, 2025 • edited Loading

trobacker commented Feb 3, 2025 • edited Loading

rogersbw commented Feb 3, 2025

trobacker commented Jan 26, 2025 •

edited

Loading

trobacker Jan 27, 2025 •

edited

Loading

trobacker commented Jan 29, 2025 •

edited

Loading

trobacker commented Feb 3, 2025 •

edited

Loading