pep_sex_2024 changes made #1110

kurus21 · 2024-11-06T18:53:42Z

No description provided.

krishnaswamypradeep · 2024-11-13T13:12:09Z

@kurus21 Can you remove input & output folder and confirm?

krishnaswamypradeep · 2024-11-13T13:38:23Z

scripts/us_census/pep/us_pep_sex/process copy.py

Remove this file

The File has been removed.

krishnaswamypradeep

Thanks Kuru. Looks good.

ajaits · 2024-11-15T13:46:33Z

scripts/us_census/pep/us_pep_sex/process.py

+                                 skiprows=7,
+                                 skipfooter=102,
+                                 header=None)
+            df.columns = [


pls use df.rename() instead of assuming column order.

As discussed over chat rename method doesn't gives an upper hand since that approach has also demands to assume the rows and cols position.

ajaits · 2024-11-15T13:47:51Z

scripts/us_census/pep/us_pep_sex/process.py

                             skipfooter=102,
                             header=None)
+            df.columns = [


pls use df.rename()

As discussed over chat rename method doesn't gives an upper hand since that approach has also demands to assume the rows and cols position.

ajaits · 2024-11-15T13:49:47Z

scripts/us_census/pep/us_pep_sex/process.py

+                'White Total', 'White Male', 'White Female', 'NonWhite Total',
+                'NonWhite Male', 'NonWhite Female'
+            ]
+            df = df.drop(columns=[


more readable to list columns of interest to be retained:
df.drop(columns=df.columns.difference(['Count_Person_Male', 'Count_Person_Female']), inplace=True)

Then it can be moved outside the if/else block

It has been modified accordingly.

ajaits · 2024-11-15T13:51:38Z

scripts/us_census/pep/us_pep_sex/process.py

+            # adding geoid, year and measurement method
+            df['Year'] = year
+            df.insert(0, 'geo_ID', 'country/USA', True)
+            df['Measurement_Method'] = 'dcAggregate/CensusPEPSurvey_PartialAggregate'


This seems common to both if and else and can be moved out.

It has been modified accordingly.

ajaits · 2024-11-15T13:53:25Z

scripts/us_census/pep/us_pep_sex/process.py

+        for col in float_col.columns.values:
+            df[col] = df[col].astype('int64')
+            df[col] = df[col].astype("str").str.replace("-1", "")
+        df.rename(columns={'SEX': 'Year'}, inplace=True)


Why is the column 'SEX' being renamed to 'Year' here and in functions below.

It has been renamed accordingly to match the data frame after modification.

ajaits · 2024-11-15T13:59:49Z

scripts/us_census/pep/us_pep_sex/process.py

+                'POPEST_FEM': 'Count_Person_Female',
+                'YEAR': 'Year'
+            })
+        df = df.drop(columns=[


may be easier to to do df.drop(columns=df.columns.difference([])..)

ajaits · 2024-11-15T14:00:21Z

scripts/us_census/pep/us_pep_sex/process.py

+            'Count_Person_Male', 'Count_Person_Female'
+        ]
+        df = pd.read_excel(file_path, skiprows=5, skipfooter=7, header=None)
+        df.columns = column_name


pls use df.rename()

As same as above

ajaits · 2024-11-15T14:02:43Z

scripts/us_census/pep/us_pep_sex/process.py

+            'July2022Female',
+            'July2023Male',
+            'July2023Female',
+            '2023Total',


Can we generalize this to 2024 and future years?

It has been generalized for future years

ajaits · 2024-11-15T14:04:44Z

scripts/us_census/pep/us_pep_sex/process.py

+                "sc-est2023-syasex-": _state_2023,
+                "sc-est2023-agesex-": _state_2023,
+                "cc-est2023-agesex-": _county_2023,
+                "cc-est2023-agesex-a": _county_2023


can we also extend to handle future years assuming the same format?

Yes modified accordingly

krishnaswamypradeep · 2024-11-25T06:53:15Z

scripts/us_census/pep/us_pep_sex/process.py

+        return df
+    except Exception as e:
+        logging.fatal(f"Error processing the file {file_path}: {e}")
+    except Exception as e:


multiple except block are there. Remove the duplicate ones.

krishnaswamypradeep · 2024-11-25T06:57:12Z

scripts/us_census/pep/us_pep_sex/process.py

-    return df
+    try:
+        df = pd.read_csv(file_path, thousands=',', skiprows=4, header=None)
+        df.columns = [


Consider implementing a more dynamic approach to identify the required columns instead of hardcoding their order.?

We have to assume the position of the columns [rows and cols] anyway. So it has been hard coded like other places.
Please be informed it has been handled with try and catch block anyway.

Kuru, could you add a comment to the script explaining the reason for fixing the column order? This will help future developers understand the rationale behind the change

krishnaswamypradeep

Hi Kuru, Work on the comments provided.

kurus21 · 2024-11-25T08:23:42Z

Hi Kuru, Work on the comments provided.
Please be informed that the comments has been addressed

kurus21 · 2024-11-25T08:24:15Z

Please be informed that the comments has been updated.

krishnaswamypradeep

Thanks Kuru. Looks good.

pep_sex_2024 changes made

3bbd500

blunderbuss-gcf bot assigned spiekos Nov 6, 2024

kurus21 added 3 commits November 6, 2024 18:58

pep_sex_2024 changes made

234267e

PEP_SEX Changes are done

fe1bafd

pep_sex changes made

569eec0

pep_sex changes made

4982046

krishnaswamypradeep reviewed Nov 13, 2024

View reviewed changes

pep_sex changes made

34fee6a

krishnaswamypradeep approved these changes Nov 13, 2024

View reviewed changes

kurus21 and others added 3 commits November 15, 2024 07:42

SCHEDULES=scripts/us_census/pep/us_pep_sex:USCensusPEP_Sex

9daf98d

USCensusPEP_Sex 20241115 changes

c42c0f5

Merge branch 'master' into pep_sex_2024

78e30af

ajaits reviewed Nov 15, 2024

View reviewed changes

kurus21 added 3 commits November 21, 2024 11:39

pep sex 20241121 changes

a70fb8a

Merge branch 'pep_sex_2024' of github.com:kurus21/data into pep_sex_2024

58f58ee

pep sex latest 20241121 changes

18873d7

krishnaswamypradeep reviewed Nov 25, 2024

View reviewed changes

kurus21 added 2 commits November 25, 2024 08:26

pep sex 20241125 changes

4b1dc52

pep sex latest 20241121 changes

6ebbfa0

krishnaswamypradeep approved these changes Nov 25, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pep_sex_2024 changes made #1110

pep_sex_2024 changes made #1110

kurus21 commented Nov 6, 2024

krishnaswamypradeep commented Nov 13, 2024

krishnaswamypradeep Nov 13, 2024

kurus21 Nov 13, 2024

krishnaswamypradeep left a comment

ajaits Nov 15, 2024 •

edited

Loading

kurus21 Nov 25, 2024

ajaits Nov 15, 2024

kurus21 Nov 25, 2024

ajaits Nov 15, 2024 •

edited

Loading

kurus21 Nov 25, 2024

ajaits Nov 15, 2024 •

edited

Loading

kurus21 Nov 25, 2024

ajaits Nov 15, 2024 •

edited

Loading

kurus21 Nov 25, 2024

ajaits Nov 15, 2024

kurus21 Nov 25, 2024

ajaits Nov 15, 2024

kurus21 Nov 25, 2024

ajaits Nov 15, 2024

kurus21 Nov 25, 2024

ajaits Nov 15, 2024 •

edited

Loading

kurus21 Nov 25, 2024

krishnaswamypradeep Nov 25, 2024

kurus21 Nov 25, 2024

krishnaswamypradeep Nov 25, 2024

kurus21 Nov 25, 2024

krishnaswamypradeep Nov 25, 2024

krishnaswamypradeep left a comment

kurus21 commented Nov 25, 2024

kurus21 commented Nov 25, 2024

krishnaswamypradeep left a comment

pep_sex_2024 changes made #1110

Are you sure you want to change the base?

pep_sex_2024 changes made #1110

Conversation

kurus21 commented Nov 6, 2024

krishnaswamypradeep commented Nov 13, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

krishnaswamypradeep left a comment

Choose a reason for hiding this comment

ajaits Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ajaits Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ajaits Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ajaits Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ajaits Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

krishnaswamypradeep left a comment

Choose a reason for hiding this comment

kurus21 commented Nov 25, 2024

kurus21 commented Nov 25, 2024

krishnaswamypradeep left a comment

Choose a reason for hiding this comment

ajaits Nov 15, 2024 •

edited

Loading

ajaits Nov 15, 2024 •

edited

Loading

ajaits Nov 15, 2024 •

edited

Loading

ajaits Nov 15, 2024 •

edited

Loading

ajaits Nov 15, 2024 •

edited

Loading