Fix chunking issues in sum_AMEL and reduce_damages #83

JMGilbert · 2023-05-11T15:25:45Z

No description provided.

…Lab/dscim into dscim-v0.4.0_fixes

codecov · 2023-05-11T20:23:48Z

Codecov Report

Merging #83 (43b7843) into dscim-v0.4.0 (152ae4f) will increase coverage by 0.21%.
The diff coverage is 91.30%.

@@               Coverage Diff                @@
##           dscim-v0.4.0      #83      +/-   ##
================================================
+ Coverage         67.99%   68.21%   +0.21%     
================================================
  Files                17       17              
  Lines              1859     1878      +19     
================================================
+ Hits               1264     1281      +17     
- Misses              595      597       +2

Impacted Files	Coverage Δ
src/dscim/preprocessing/preprocessing.py	`71.81% <80.00%> (-0.30%)`	⬇️
src/dscim/preprocessing/input_damages.py	`88.72% <94.44%> (+0.35%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

kemccusker · 2023-06-28T17:41:37Z

src/dscim/preprocessing/input_damages.py

+    save_path str
+        Path to save concatenated file in .zarr format
+    """
+    paths = glob.glob(f"{damage_dir}/{basename}*")


I usually prefer to explicitly create a list of filenames to open, in case there's extra data files or anything like that. Maybe that's handled in a data check later?

kemccusker · 2023-06-28T17:43:31Z

src/dscim/preprocessing/input_damages.py

+
+    for v in list(data.coords.keys()):
+        if data.coords[v].dtype == object:
+            data.coords[v] = data.coords[v].astype("unicode")


might as well handle this in a unit test to add the coverage and avoid the warning

kemccusker · 2023-06-28T17:43:47Z

src/dscim/preprocessing/input_damages.py

+            data.coords[v] = data.coords[v].astype("unicode")
+    for v in list(data.variables.keys()):
+        if data[v].dtype == object:
+            data[v] = data[v].astype("unicode")


same as above comment

src/dscim/preprocessing/input_damages.py

kemccusker · 2023-06-28T17:48:02Z

src/dscim/preprocessing/preprocessing.py

+                    "ssp": 1,
+                }
+            else:
+                chunkies = {


please add to unit tests

kemccusker · 2023-06-28T17:50:04Z

src/dscim/preprocessing/preprocessing.py

+                .rename(var)
+                .chunk(
+                    {
+                        "batch": 15,


seeing this dictionary of chunks repeated many times confirms that we should generalize at least a little bit - perhaps define a global chunkies and eventually put into a config. This can be done in a later PR.

kemccusker · 2023-07-06T17:52:43Z

We decided to add the test coverage and generalizing of chunk sizes to later PRs.

JMGilbert and others added 5 commits May 11, 2023 08:25

Fix chunking issues in sum_AMEL and reduce_damages

c75f747

Remove unused variable

d0cc037

Sort batches in the right order

031e01c

Merge branch 'dscim-v0.4.0_fixes' of https://github.com/ClimateImpact…

f3129b9

…Lab/dscim into dscim-v0.4.0_fixes

Update test_parse_projection_filesys()

a1e6d95

JMGilbert and others added 8 commits May 16, 2023 14:04

Add region to damages chunk sizes

9e66ab9

Add a function for concatenating labor/energy damage output

fab2286

Chunk coastal

c6d1349

Add unit test for concatenate_damage_output

e187c1d

Import function

7302ed4

chunk concatenated energy/labor and save mortality to float32

e7915c4

fix a small issue

66bd270

Update test_input_damages.py because mortality has saved in float32

58682cc

JMGilbert marked this pull request as ready for review June 12, 2023 15:50

JMGilbert requested review from kemccusker and davidrzhdu June 12, 2023 15:50

Update CHANGELOG.md

970c623

kemccusker reviewed Jun 28, 2023

View reviewed changes

src/dscim/preprocessing/input_damages.py Show resolved Hide resolved

kemccusker reviewed Jun 28, 2023

View reviewed changes

src/dscim/preprocessing/preprocessing.py

"ssp": 1,

}

else:

chunkies = {

Copy link

Member

kemccusker Jun 28, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please add to unit tests

kemccusker reviewed Jun 28, 2023

View reviewed changes

JMGilbert and others added 6 commits June 29, 2023 13:49

Update test_input_damages.py

7fc0f03

create a list of filenames to open in 'concatenate_damage_output'

edc191e

update test_concatenate_damage_output

7574b6e

Ensure that dtype = object is tested

c730d26

Change object coordinate

f0e6ede

update test_concatenate_damage_output

ae69955

Change dtype of batch

43b7843

kemccusker mentioned this pull request Jul 6, 2023

increase test coverage in input_damages.py #87

Open

kemccusker merged commit d9bdae3 into dscim-v0.4.0 Jul 6, 2023

kemccusker deleted the dscim-v0.4.0_fixes branch July 6, 2023 17:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix chunking issues in sum_AMEL and reduce_damages #83

Fix chunking issues in sum_AMEL and reduce_damages #83

JMGilbert commented May 11, 2023

codecov bot commented May 11, 2023 •

edited

Loading

kemccusker Jun 28, 2023

kemccusker Jun 28, 2023

kemccusker Jun 28, 2023

kemccusker Jun 28, 2023

kemccusker Jun 28, 2023

kemccusker commented Jul 6, 2023

Fix chunking issues in sum_AMEL and reduce_damages #83

Fix chunking issues in sum_AMEL and reduce_damages #83

Conversation

JMGilbert commented May 11, 2023

codecov bot commented May 11, 2023 • edited Loading

Codecov Report

kemccusker Jun 28, 2023

Choose a reason for hiding this comment

kemccusker Jun 28, 2023

Choose a reason for hiding this comment

kemccusker Jun 28, 2023

Choose a reason for hiding this comment

kemccusker Jun 28, 2023

Choose a reason for hiding this comment

kemccusker Jun 28, 2023

Choose a reason for hiding this comment

kemccusker commented Jul 6, 2023

codecov bot commented May 11, 2023 •

edited

Loading