forked from awslabs/open-data-registry
-
Notifications
You must be signed in to change notification settings - Fork 0
/
catalyst-cooperative-pudl.yaml
66 lines (63 loc) · 3.48 KB
/
catalyst-cooperative-pudl.yaml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
Name: Public Utility Data Liberation Project
Description: |
The Public Utility Data Liberation Project (PUDL) provides analysis-ready energy system data to climate advocates,
researchers, policymakers, and journalists.
<br/>
<br/>
[PUDL](https://catalyst.coop/pudl/) is an [open source data processing pipeline](https://github.com/catalyst-cooperative/pudl)
that makes US energy data easier to access and use programmatically. Hundreds of gigabytes of valuable data
are published by US government agencies, but it's often difficult to work with.
PUDL takes the original spreadsheets, CSV files, and databases and turns them into a unified resource. This allows users to
spend more time on novel analysis and less time on data preparation.
<br/>
<br/>
This information allows users to explore the operating costs of individual power plants, and see how fuel costs impact
the viability of different types of generation. It can highlight the competitiveness of renewable electricity in the
market today. It can show how the generation mix of different utilities has evolved over time, and how the usage of
individual power plants has changed as fuel prices have changed and more renewable generation has been brought online.
<br/>
<br/>
The data hosted on Amazon Web Services is intended to be accessed through the
[PUDL Intake Catalog](https://github.com/catalyst-cooperative/pudl-catalog).
The catalog allows users to access the data via a uniform API for each data type (parquet, SQL),
handles local caching and provides rich metadata about the data.
Documentation: |
To access the data via the the PUDL intake catalog, follow the setup
[instructions in the documentation](https://catalystcoop-pudl-catalog.readthedocs.io/en/latest/).
You can learn more about the data in the [PUDL data dictionary documentation](https://catalystcoop-pudl.readthedocs.io/en/dev/data_dictionaries/index.html).
Contact: For general questions or feedback about the data, create an GitHub issue or discussion in the [PUDL repo](https://github.com/catalyst-cooperative/pudl). We also love talking to our users during [PUDL Office Hours](https://calend.ly/catalyst-cooperative/pudl-office-hours).
ManagedBy: "[Catalyst Cooperative](https://catalyst.coop/)"
UpdateFrequency: |
The federal agencies that publish the raw data PUDL processes release new data, monthly, quarterly and yearly.
PUDL is continuously improving the data and tries to release new versions of the data monthly.
Tags:
- climate
- climate model
- energy
- environmental
- government records
- infrastructure
- open source software
- electricity
- energy modeling
- utilities
License: The PUDL data and documentation are published under the [Creative Commons Attribution License v4.0](https://creativecommons.org/licenses/by/4.0/) (CC-BY-4.0).
Resources:
- Description: All PUDL data outputs.
ARN: arn:aws:s3:::intake.catalyst.coop
Region: us-west-2
Type: S3 Bucket
DataAtWork:
Tutorials:
- Title: PUDL Intake Catalog Setup
URL: https://catalystcoop-pudl-catalog.readthedocs.io/en/latest/
AuthorName: Catalyst Cooperative
AuthorURL: https://catalyst.coop/
- Title: PUDL Examples
URL: https://github.com/catalyst-cooperative/pudl-examples
AuthorName: Catalyst Cooperative
AuthorURL: https://catalyst.coop/
NotebookURL: https://github.com/catalyst-cooperative/pudl-examples/blob/main/notebooks/03-pudl-parquet.ipynb
ADXCategories:
- Environmental Data
- Resources Data