Pyudunits2 #1118

ocefpaf · 2024-11-05T14:31:48Z

Possible alternative for #1094

TODO:

add tests for optional cf-units
remove all the workarounds in compliance_checker/cfutil.py (_CCUnits and _units) when pyudunits2 is released.

compliance_checker/cf/util.py

ocefpaf · 2024-11-07T08:17:59Z

@pelson Windows failures apart due to the micromamba bug, and the hacks I made to get the dates to work, this is great! Pyudunits2 can replace cf-units with just a few minor tweaks IMO.

ocefpaf · 2024-11-07T12:19:03Z

This is a bit worrying. The test run time increased a lot with pyudunits2, that may be some of my hacks though.

Develop branch:

python -m pytest -s -rxs -v -k "not integration" compliance_checker  13.12s user 0.30s system 73% cpu 18.220 total

This branch:

python -m pytest -s -rxs -v -k "not integration" compliance_checker  529.36s user 0.62s system 99% cpu 8:54.87 total

codecov · 2024-11-12T20:14:57Z

Codecov Report

Attention: Patch coverage is 35.00000% with 39 lines in your changes missing coverage. Please review.

Project coverage is 71.25%. Comparing base (066a826) to head (5165974).
Report is 1 commits behind head on develop.

Files with missing lines	Patch %	Lines
compliance_checker/cfutil.py	31.91%	32 Missing ⚠️
compliance_checker/cf/util.py	42.85%	4 Missing ⚠️
compliance_checker/ioos.py	0.00%	2 Missing ⚠️
compliance_checker/cf/cf_1_6.py	75.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##           develop    #1118       +/-   ##
============================================
- Coverage    81.91%   71.25%   -10.66%     
============================================
  Files           25       25               
  Lines         5224     5263       +39     
  Branches      1163     1169        +6     
============================================
- Hits          4279     3750      -529     
- Misses         644     1166      +522     
- Partials       301      347       +46

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

compliance_checker/cf/cf_1_6.py

compliance_checker/cfutil.py

pelson

Think things moved forward a bit in pyudunits2 since you made this MR. Would love to know how the performance compares with these changes (keeping in mind that pyudunits2 has not yet been optimised, and there is probably a lot of room for improvement)

pelson · 2025-02-12T04:05:17Z

compliance_checker/cf/cf_1_6.py

@@ -2848,7 +2848,9 @@ def _cell_measures_core(self, ds, var, external_set, variable_template):
                        valid = False
                        reasoning.append(conversion_failure_msg)
                    else:
-                        if not cell_measure_units.is_convertible(Unit(f"m{exponent}")):
+                        if not cell_measure_units.is_convertible(


Would be interested to measure the performance of this vs for example:

>>> cm_dimensionality = cell_measure_units.dimensionality() >>> cm_symbolic_dim = {basis_unit._names.symbols[0]: order for basis_unit, order in cm_dimensionality.items()} >>> cm_symbolic_dim == {'m': 2} True

(note the private API usage for now - would like to expose symbol in a friendly form ASAP)

Your suggestion is quite faster (again, this is poor's man benchmark):

%timeit cm_dimensionality = cell_measure_units.dimensionality(); cm_symbolic_dim = {basis_unit._names.symbols[0]: order for basis_unit, order in cm_dimensionality.items()}; cm_symbolic_dim == {'m': 2} 5.3 μs ± 148 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each) %timeit cell_measure_units.is_convertible_to(ut_system.unit("m2")) 1.64 ms ± 24.6 μs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)

Thanks for measuring! I totally forgot that I actually made this easier:

>>> cell_measure_units.dimensionality() == {'meter': 2} True

Performance shouldn't be any different.

compliance_checker/cfunits.py

pelson · 2025-02-12T04:12:08Z

compliance_checker/cfunits.py

+            self.units = self.ut_system.unit(units)
+        except (SyntaxError, UnresolvableUnitException) as err:
+            raise ValueError from err
+        self.definition = self.units.expanded()


Would avoid doing this until you have to. On the pyudunits2 side we would cache this within the unit anyway (on first request).

It looks like this isn't actually used presently anyway?

My memory is failing me here now but I believe I added this to create a correspondence with cf_units. However, if the changes above solve the slowdowns I created here with my messy code, I'll probably just ditch cf_units and use only pyudunits2.

It is used in https://github.com/ioos/compliance-checker/pull/1118/files#diff-69711c6343a7337979fd1164476dafdc426e76e16f13d658775d33481fa1b0ecR1381 BTW.

Some very crude benchmarks points to this as the source of slowdowns.

%timeit ut_system.unit("meters").expanded() 90.6 μs ± 2.45 μs per loop (mean ± std. dev. of 7 runs, 10,000 loops each)

vs

%timeit cf_units.Unit("meter").definition 3.4 μs ± 34.3 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)

I believe all the pieces to return a faster definition, like cf-units does, is already in pyudunits2. I think that all we need is to expose the singular name from ._identifier_references of a unit.

Thanks for pointing out where it is used!

That check intentionally doesn't support cm as a unit (even though meters is fine), but it does support inch, and any aliases for that (including in). Am I reading that right?

I guess we need to think about what equality of units means - I think m == meters, but m != 100 cm is probably the way to go. We may want some equivalence test (not the same as equality) where m is equivalent to 100 cm, to go alongside the convertible test that we already have.

With such a definition, I would be able to add hashability to the Unit, such that you can make a set of units {unit1, unit2}, and check that the unit you are testing against is in the set (which would handle the possible variations on spellings etc. out of the box)...

valid_units = {unit('meters'), unit('fathoms')} assert unit('m') in valid_units # Note the different spelling assert unit('cm') not in valid_units

I think all of this needs to go into some documentation in pyudunits2 about common patterns.

All of the above doesn't solve the performance difference. The numbers are really useful, so thank you!

compliance_checker/cfunits.py

ocefpaf mentioned this pull request Nov 5, 2024

Feedback on the conversion and dimensionality functionality pelson/pyudunits2#3

Open

pelson reviewed Nov 6, 2024

View reviewed changes

compliance_checker/cf/util.py Outdated Show resolved Hide resolved

pelson reviewed Nov 6, 2024

View reviewed changes

compliance_checker/cf/util.py Outdated Show resolved Hide resolved

pelson reviewed Nov 6, 2024

View reviewed changes

compliance_checker/cf/util.py Outdated Show resolved Hide resolved

ocefpaf force-pushed the pyudunits2 branch from 66bcddb to 4b8ab7e Compare November 6, 2024 19:13

ocefpaf mentioned this pull request Nov 7, 2024

is this covered by the tests? #1119

Merged

ocefpaf force-pushed the pyudunits2 branch 3 times, most recently from c4a1619 to 85723cb Compare November 12, 2024 20:09

ocefpaf mentioned this pull request Nov 12, 2024

Use pint instead of cf_units #1094

Closed

ocefpaf force-pushed the pyudunits2 branch from 85723cb to 17b248f Compare November 12, 2024 20:13

ocefpaf force-pushed the pyudunits2 branch from 17b248f to f1eb65a Compare November 12, 2024 20:21

ocefpaf commented Nov 12, 2024

View reviewed changes

compliance_checker/cf/cf_1_6.py Outdated Show resolved Hide resolved

ocefpaf commented Nov 12, 2024

View reviewed changes

compliance_checker/cf/cf_1_6.py Outdated Show resolved Hide resolved

This was referenced Nov 13, 2024

Should the unit have a unit system reference? pelson/pyudunits2#7

Open

Performance with cf-units pelson/pyudunits2#2

Open

ocefpaf commented Nov 13, 2024

View reviewed changes

compliance_checker/cfutil.py Outdated Show resolved Hide resolved

ocefpaf force-pushed the pyudunits2 branch from 35c13a6 to 880dac9 Compare November 13, 2024 09:35

ocefpaf commented Nov 13, 2024

View reviewed changes

compliance_checker/cfutil.py Outdated Show resolved Hide resolved

ocefpaf commented Nov 13, 2024

View reviewed changes

compliance_checker/cfutil.py Outdated Show resolved Hide resolved

ocefpaf force-pushed the pyudunits2 branch 6 times, most recently from a5f04cc to 54fbfc8 Compare November 15, 2024 08:07

ocefpaf force-pushed the pyudunits2 branch from 54fbfc8 to ca1c28c Compare November 21, 2024 17:50

ocefpaf force-pushed the pyudunits2 branch from ca1c28c to 815aca6 Compare November 21, 2024 17:52

ocefpaf added 5 commits February 7, 2025 08:40

pyudunits2

fb6301a

bump pyudunts2

145ced5

add cf-units test

3b11d09

revert b/c we are no longer using cftime

fd04d25

refactor

0e7ef12

ocefpaf force-pushed the pyudunits2 branch from 815aca6 to 0e7ef12 Compare February 7, 2025 07:40

pelson reviewed Feb 12, 2025

View reviewed changes

ocefpaf added 4 commits February 12, 2025 18:36

use released pyudunits2

4fdaf8f

catch NotImplemented as non-convertible units

648affc

review suggestions

f503710

review suggestions

c765a73

This was referenced Feb 13, 2025

Add docs, and a section on common usecases and approaches pelson/pyudunits2#19

Open

Focus on performance of unit.definition pelson/pyudunits2#20

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pyudunits2 #1118

Pyudunits2 #1118

ocefpaf commented Nov 5, 2024 •

edited

Loading

ocefpaf commented Nov 7, 2024

ocefpaf commented Nov 7, 2024

codecov bot commented Nov 12, 2024 •

edited

Loading

pelson left a comment •

edited

Loading

pelson Feb 12, 2025

ocefpaf Feb 12, 2025 •

edited

Loading

pelson Feb 13, 2025

pelson Feb 12, 2025

ocefpaf Feb 12, 2025

ocefpaf Feb 12, 2025

ocefpaf Feb 12, 2025 •

edited

Loading

pelson Feb 13, 2025

Pyudunits2 #1118

Are you sure you want to change the base?

Pyudunits2 #1118

Conversation

ocefpaf commented Nov 5, 2024 • edited Loading

ocefpaf commented Nov 7, 2024

ocefpaf commented Nov 7, 2024

codecov bot commented Nov 12, 2024 • edited Loading

Codecov Report

pelson left a comment • edited Loading

Choose a reason for hiding this comment

pelson Feb 12, 2025

Choose a reason for hiding this comment

ocefpaf Feb 12, 2025 • edited Loading

Choose a reason for hiding this comment

pelson Feb 13, 2025

Choose a reason for hiding this comment

pelson Feb 12, 2025

Choose a reason for hiding this comment

ocefpaf Feb 12, 2025

Choose a reason for hiding this comment

ocefpaf Feb 12, 2025

Choose a reason for hiding this comment

ocefpaf Feb 12, 2025 • edited Loading

Choose a reason for hiding this comment

pelson Feb 13, 2025

Choose a reason for hiding this comment

ocefpaf commented Nov 5, 2024 •

edited

Loading

codecov bot commented Nov 12, 2024 •

edited

Loading

pelson left a comment •

edited

Loading

ocefpaf Feb 12, 2025 •

edited

Loading

ocefpaf Feb 12, 2025 •

edited

Loading