Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

πŸ“Š data: undp 2024 #2504

Merged
merged 12 commits into from
Apr 18, 2024
Merged

πŸ“Š data: undp 2024 #2504

merged 12 commits into from
Apr 18, 2024

Conversation

lucasrodes
Copy link
Member

@lucasrodes lucasrodes commented Apr 9, 2024

↑ Tracking issue

  • Add the new UNDP 2024 dataset to ETL.
  • Update all charts (32) in the staging server.

@lucasrodes
Copy link
Member Author

lucasrodes commented Apr 9, 2024

Summary of changes

The indicators from 2022 and 2024 releases do not present any major changes. However, some minor ones are present, which I enumerate below.

The most noticeable change is in the estimates on income inequality for Southen African countries (estimates are higher in the 2024 release for multiple countries).

Expected years of schooling

  • Lebanon
  • Marshall Islands
  • Moldova

Gender Inequality Index

  • Cyprus

New Data: Somalia
Data gone for: Central African Republic, South Sudan, Turkmenistan

Gross national income per capita

  • Argentina
  • Bangladesh
  • Afghanistan
  • Libya

Inequality in education

  • Decent increase in Iran

Inequality in income

Several changes in countries

  • Myanmar
  • Afghanistan
  • Ethiopia
  • Burkina Faso
  • Cambodia
  • Burundi
  • Pakistan
  • Niger
  • Madagascar
  • Nepal
  • Ghana
  • Liberia

Overall loss

  • Iran
  • Sao Tome e Principe
  • Sri Lanka
  • Montenegro
  • Myanmar
  • Zimbabwe
  • Congo

Mean years of schooling

  • Burkina Faso
  • Micronesia (country)
  • Cape Verde
  • Equatorial Guinea

Mean years of schooling, female

  • Burkina Faso
  • Equatorial Guinea
  • Mali

Mean years of schooling, female

  • Burkina Faso
  • Cape Verde
  • Mali

@lucasrodes
Copy link
Member Author

lucasrodes commented Apr 9, 2024

πŸ› Unresolved issue

When updating the following charts, only data for Somalia is shown for 2022.

If we change the tolerance of the Human (or Gender) Development Index from 0 to 1, it seems to work. However, this shouldn't be necessary, as there is data for most countries for 2022. If we set tolerance to 1, data for 2021 for most shown countries is shown (when we have for 2022!)

@lucasrodes lucasrodes requested a review from paarriagadap April 9, 2024 20:24
@lucasrodes lucasrodes marked this pull request as ready for review April 9, 2024 20:24
Copy link
Contributor

@paarriagadap paarriagadap left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, thank you! I only request changes to integrate the metadata we included in the HDI article we worked last year with Bastian. You can find that in grapher/un/2022-11-29/undp_hdr.meta.yml

How different are the estimates you listed? If it's not too much it's not a big deal.

dag/main.yml Outdated Show resolved Hide resolved
dag/main.yml Show resolved Hide resolved
@lucasrodes
Copy link
Member Author

@paarriagadap thanks for all your suggestions! I addressed all your comments. They all make sense to me.

Could you double-check the metadata? I've integrated the metadata from the old Grapher step. I've also added some more details to female/male dimensions of the variable (and others).

@lucasrodes lucasrodes requested a review from paarriagadap April 17, 2024 21:34
@owidbot
Copy link
Contributor

owidbot commented Apr 18, 2024

Staging server:
etl diff: βœ… No differences found
+ Dataset garden/un/2024-04-09/undp_hdr
+ + Table undp_hdr
+   + Column abr
+   + Column co2_prod
+   + Column coef_ineq
+   + Column diff_hdi_phdi
+   + Column eys
+   + Column eys_f
+   + Column eys_m
+   + Column gdi
+   + Column gdi_group
+   + Column gii
+   + Column gii_rank
+   + Column gni_pc_f
+   + Column gni_pc_m
+   + Column gnipc
+   + Column hdi
+   + Column hdi_f
+   + Column hdi_m
+   + Column hdi_rank
+   + Column ihdi
+   + Column ineq_edu
+   + Column ineq_inc
+   + Column ineq_le
+   + Column le
+   + Column le_f
+   + Column le_m
+   + Column lfpr_f
+   + Column lfpr_m
+   + Column loss
+   + Column mf
+   + Column mmr
+   + Column mys
+   + Column mys_f
+   + Column mys_m
+   + Column phdi
+   + Column pop_total
+   + Column pr_f
+   + Column pr_m
+   + Column rankdiff_hdi_phdi
+   + Column se_f
+   + Column se_m


Legend: +New  ~Modified  -Removed  =Identical  Details
Hint: Run this locally with etl diff REMOTE data/ --include yourdataset --verbose --snippet

Automatically updated datasets matching weekly_wildfires|excess_mortality|covid|fluid|flunet|country_profile are not included

Edited: 2024-04-18 11:07:00 UTC
Execution time: 15.70 seconds

@lucasrodes
Copy link
Member Author

I'll merge for now, we can update and modify the metadata later on if needed.

@lucasrodes lucasrodes merged commit 0ac820e into master Apr 18, 2024
9 checks passed
@lucasrodes lucasrodes deleted the data/undp branch April 18, 2024 15:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants