-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
first implimentation of rsv #2121
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks good.
We should do a brief statistical review on the new RSV signals. At minimum, we need national and state signals plotted over time, and RSV for some of those geos compared to flu and covid signals.
Statistical review plots: rsv_analysis.zip Using one of those plots as an example, RSV is shown as recently (in the current season) increasing maybe 50x above past values. This trend is visible in all plots. Such a large increase is not believable and doesn't match other data sources, e.g., shows that hospitalizations this year are comparable to the last two years (CDC RSV-NET, go to "Patient Characteristics" -> "Case Counts"). Emergency department admissions per capita show the same trend. (CDC RSV-NET, select "All seasons" on the side bar). Percent of emergency department admissions attributable to RSV (not shown) also have the same trend. I'm assuming something went wrong with our handling of the data. It's also possible that the source started reporting new streams (e.g. previously only adults, now added in children), but honestly the increase is so large that this seems unlikely. Edit: possibly a plotting bug? We need to
|
not quite the same but mentioned below
from the download link (https://data.cdc.gov/Public-Health-Surveillance/Weekly-Rates-of-Laboratory-Confirmed-RSV-Hospitali/29hc-w46k/about_data) i'm not quite sure how to compare? do I use rate? Cumulative Rate? there is rate column
We don't do any transformation besides filtering for "US" or "region" value for the geo columns one of the main issues is that at least in NHSN data, people didn't report RSV until 2023/2024 that's the main reason for zero, and then peaking in the tail end many zeros in the csv there's also spearman through time Once I filter out the dates the comparison does seem more reasonable -- minus the plotting bug |
Ah, I see, it was because of the reporting mandate changes:
from the NHSN data description. We'll need to note this in the documentation, because the numbers prior to this season are not useful. The national numbers match RSV-NET well for the most recent season. (RSV-NET is a representative sample covering only 9% of the population, so I multiplied it by 10 to compare values.) This is good to go. |
Description
add RSV for NHSN
Associated Issue(s)