-
-
Notifications
You must be signed in to change notification settings - Fork 120
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect generator_ids in 2023 data #3987
Comments
Hi @grgmiller, TLDR I just looked into this bug, and it's happening during a step in the harvesting process that pulls information from columns that go by names other than the generic In the
I stuck a breakpoint in the If you look at the This is also true for:
The fact that the |
Thanks for looking into this @aesharpe! I understand the source of this issue from your description, but I didn't see mention of what the planned next step would be on this issue. Would these mappings remain in or be removed from Currently in OGE, I'm just filtering on whether the |
Hi @grgmiller, sorry for the delay, I was out last week. I can think of a couple solutions, but I'm going to start by emailing EIA to see if I can better understand these generators and why they aren't showing up anywhere other than the energy storage table. For now, there is no great way to filter other than what you're currently doing. However, it's not ideal that the Is your specific issue that these records have no |
@grgmiller EIA got back to me and said the following:
For the time being we can discuss other options. |
At minimum, we could follow what I did when I originally harvested these IDs (#3699) and pull any new "fake" plant/generator IDs without any data out of the |
Describe the bug
in our OGE pipeline, we're getting warnings about missing prime mover codes for certain generators, like plant 57991 generator PV2. However, looking through all of the raw EIA data, I can't seem to find any record of a "PV2" generator at this plant. It appears in
out_eia__yearly_generators
andcore_eia__scd_generators
starting in 2023, but it has mostly missing values, and a data_maturity of NA.It appears this is a pudl bug where some records are getting mixed up.
Another example: one of the other issues is a new generator "HB2PV" at plant 1 (apparently). However, searching through the raw EIA-860 data, it looks like this generator is actually associated with plant 65851 - not sure how it is getting associated with plant 1
It also looks like there is a generator "61552" associated with plant 61153 - this looks suspiciously like a plant code.
Here's a list of all of the generators without a data_maturity, and which appear to be not real generators:
I also tested this using the nightly build version of the pudl database, downloaded 12/4/2024, and also saw this issue.
Bug Severity
How badly is this bug affecting you?
To Reproduce
Using the most recent stable version of the pudl database (stable v2024.11.0)
Expected behavior
I would expect these mismatched generators to not be there
Software Environment?
Windows, accessed via OGE, using stable v2024.11.0
The text was updated successfully, but these errors were encountered: