Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate ensemble evaluation methods #229

Open
mgdenno opened this issue Aug 20, 2024 · 3 comments · May be fixed by #361
Open

Investigate ensemble evaluation methods #229

mgdenno opened this issue Aug 20, 2024 · 3 comments · May be fixed by #361
Assignees
Milestone

Comments

@mgdenno
Copy link
Contributor

mgdenno commented Aug 20, 2024

What is needed?
Do we need to add "trace" or "member" to the timeseries table.

@samlamont
Copy link
Collaborator

samlamont commented Nov 25, 2024

Adding some notes on existing verification systems/packages for reference:

@samlamont
Copy link
Collaborator

samlamont commented Jan 2, 2025

Some additional notes on the NWM Ensemble Verification System (EVS) (Note: EVS is no longer available or maintained -- it has been/is being replaced by WRES)

reference: https://www.sciencedirect.com/science/article/pii/S1364815210000204?via%3Dihub

EVS is a GUI-based ensemble verification system built specifically for NWS written in java.

It contains three main components:

  • Verification Units (VU): Paired forecast/obs timeseries for a single variable at a single geographic location (sort of like TEEHR's configuration?). Metrics are calculated per VU.
  • Aggregation Units: Allows users to aggregate performance metrics across VU's (optional).
  • Output: Provides tabular and graphical results.

Included metrics:

  • Mean error
  • RMSE
  • Correlation coef
  • Brier Score (binary classification)
  • Brier Skill Score
  • Mean CRPS
  • Mean CRPS reliability
  • CRPSS
  • ROC Score (Relative Opearting Characteristic) - discriminate between binary events (flooding/non-flooding). ROC curves show prob of detection vs. prob of false detection
  • Skill scores are relative to some reference forecast. Can be user-defined. Typically climatology.

Visualizations:

  • Mean error in prob diagram (MEPD)
  • Mean capture rate diagram (MCRD)
  • Spread-bias diagram
  • Reliability diagram
  • ROC diagram
  • Modified box plots

@samlamont samlamont linked a pull request Jan 3, 2025 that will close this issue
@samlamont
Copy link
Collaborator

Also just making a note that an experimental HEFS API data service has recently been released by NWS.

Description: https://www.weather.gov/media/notification/pdf_2023_24/pns24-77_exp_hefs_api_data_services.pdf

Notebooks: https://github.com/NOAA-OWP/data-service-notebooks/blob/master/HEFS/2_ensemble_plotting_demo.ipynb

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants