Avoid using full equality (`==`) to compare float, avoid `assert_array_equal` compare float array #4159

DanielYang59 · 2024-11-09T03:04:29Z

Summary

Avoid using full equality == to compare float, to fix tests use equality to compare floating point numbers #4158
Avoid assert_array_equal on int array:

Assert fails with numerical imprecision with floats:
Use assert_allclose or one of the nulp (number of floating point values) functions for these cases instead:
Tweak _proj implementation, ~3x speedup
(Partially) Replace sequence of float comparison with == (list/tuple/dict ...):

pymatgen/tests/core/test_bonds.py

Line 56 in bd9fba9

assert obtain_all_bond_lengths("C", "C") == {1.0: 1.54, 2.0: 1.34, 3.0: 1.2}
Other type/comment tweaks

DanielYang59 · 2024-11-09T03:33:42Z

src/pymatgen/transformations/advanced_transformations.py

    """Get vector projection (np.ndarray) of vector b (np.ndarray)
    onto vector a (np.ndarray).
    """
-    return (b.T @ (a / np.linalg.norm(a))) * (a / np.linalg.norm(a))
+    return (np.dot(b, a) / np.dot(a, a)) * a


This new implementation is slightly more readable (personal taste) and gives ~4x speedup, reference (the following is a project to b):

Original Implementation Time: 420.86 ms New Implementation Time: 101.28 ms

Test script (by GPT):

import numpy as np from numpy.typing import NDArray from time import perf_counter_ns def _proj_original(b: NDArray, a: NDArray) -> NDArray: return (b.T @ (a / np.linalg.norm(a))) * (a / np.linalg.norm(a)) def _proj_new(b: NDArray, a: NDArray) -> NDArray: return (np.dot(b, a) / np.dot(a, a)) * a def verify_projection(): a = np.random.rand(3) b = np.random.rand(3) proj1 = _proj_original(b, a) proj2 = _proj_new(b, a) assert np.allclose(proj1, proj2) def benchmark_projections(n_iter=100000): a = np.random.rand(3) b = np.random.rand(3) # Measure original implementation start_time = perf_counter_ns() for _ in range(n_iter): _proj_original(b, a) time_original = perf_counter_ns() - start_time # Measure new implementation start_time = perf_counter_ns() for _ in range(n_iter): _proj_new(b, a) time_new = perf_counter_ns() - start_time print(f"Original Implementation Time: {time_original / 1e6:.2f} ms") print(f"New Implementation Time: {time_new / 1e6:.2f} ms") verify_projection() print("Benchmarking both implementations...") benchmark_projections()

…ymatgen into 4158-fix-eq-check

DanielYang59 · 2024-11-10T04:09:57Z

tests/analysis/xas/test_spectrum.py

@@ -38,22 +38,22 @@ def setUp(self):
        self.site2_xanes = XAS.from_dict(site2_xanes_dict)

    def test_e0(self):
-        assert approx(self.k_xanes.e0) == 7728.565


I believe a == pytest.approx(ref_val) is the recommended usage plus it's slightly more readable. However do note approx is asymmetric:

a == pytest.approx(b, rel=1e-6, abs=1e-12): True if the relative tolerance is met w.r.t. b or if the absolute tolerance is met. Because the relative tolerance is only calculated w.r.t. b, this test is asymmetric and you can think of b as the reference value. In the special case that you explicitly specify an absolute tolerance but not a relative tolerance, only the absolute tolerance is considered.

tests/analysis/chemenv/coordination_environments/test_coordination_geometries.py

DanielYang59 added 9 commits November 9, 2024 11:03

replace some float equality check

588ceb8

explicit encoding

0b97cb0

charge is also float

82f3431

enhance types

389c59b

access gcd via math namespace as math is already imported

1d22fee

put dunder method to top

84e3b70

fix typo

ea6089e

tweak _proj implementation

e264890

Merge branch 'master' into 4158-fix-eq-check

95a6192

DanielYang59 commented Nov 9, 2024

View reviewed changes

DanielYang59 added 3 commits November 9, 2024 11:40

support array like

e431882

Merge branch '4158-fix-eq-check' of https://github.com/DanielYang59/p…

8f30f13

…ymatgen into 4158-fix-eq-check

add arg and return type

e6ea809

DanielYang59 changed the title ~~Avoid using full equality to check float in unit test~~ Avoid using full equality to check float in unit test of advanced_transformations Nov 9, 2024

DanielYang59 marked this pull request as ready for review November 9, 2024 05:39

DanielYang59 requested review from shyuep and mkhorton as code owners November 9, 2024 05:39

tweak type

bf0ff16

DanielYang59 marked this pull request as draft November 10, 2024 03:11

avoid more == for float comparison

5c9992e

DanielYang59 changed the title ~~Avoid using full equality to check float in unit test of advanced_transformations~~ Avoid using full equality to compare float Nov 10, 2024

DanielYang59 changed the title ~~Avoid using full equality to compare float~~ Avoid using full equality (==) to compare float Nov 10, 2024

replace some == in test, more left to do

4920eb7

DanielYang59 commented Nov 10, 2024

View reviewed changes

DanielYang59 added 6 commits November 10, 2024 12:29

replace more in core test

f343503

replace more in test

808c495

replace even more

c0692dd

replace last batch

48e0ead

clean up assert approx

cdff78d

replace pytest.approx with approx

7eb7caa

DanielYang59 added 3 commits November 10, 2024 15:29

fix approx in condition block

d12a07b

replace sci notation

4552881

suppress buggy ruff sim300

30e0f66

DanielYang59 commented Nov 10, 2024

View reviewed changes

tests/analysis/chemenv/coordination_environments/test_coordination_geometries.py Show resolved Hide resolved

DanielYang59 added 18 commits November 10, 2024 17:22

number_of_permutations to int

4f0ff82

revert change for formula_double_format, in favor of another PR

24e81d2

c_indices seems to be int

8ef27dc

use sci notation for crazily large int

5ac947d

simplify numpy.testing usage

2bde949

set tol as pos arg

16fa94d

avoid array equal for list of str

300dc30

assert_array_equal should not be used on float array

d4309b7

fix module level var name

8cbfcfc

more assert_array_equal on complex number

dbe8659

simplify approx on dict value

5ff4248

avoid module level var when it's used only 3 times

1f01241

pytext.approx to approx

32929d4

fix approx on nested dict

16dbec3

avoid unnecessary convert to np.array

3df99ab

array_equal to all close for float array

fd573cd

assert all close for float array

e46dbf9

capital class attrib is treated as constant

857581b

DanielYang59 changed the title ~~Avoid using full equality (==) to compare float~~ Avoid using full equality (==) to compare float, avoid assert_array_equal compare float array Nov 11, 2024

DanielYang59 marked this pull request as ready for review November 11, 2024 03:43

DanielYang59 added 6 commits November 13, 2024 10:18

Merge remote-tracking branch 'upstream/master' into 4158-fix-eq-check

7787a25

Merge remote-tracking branch 'upstream/master' into 4158-fix-eq-check

5700f3b

Merge branch 'master' into 4158-fix-eq-check

79d3ffc

Merge remote-tracking branch 'upstream/master' into 4158-fix-eq-check

1724f9e

Merge branch 'master' into 4158-fix-eq-check

e7e5209

Merge branch 'master' into 4158-fix-eq-check

626c3fb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid using full equality (`==`) to compare float, avoid `assert_array_equal` compare float array #4159

Avoid using full equality (`==`) to compare float, avoid `assert_array_equal` compare float array #4159

DanielYang59 commented Nov 9, 2024 •

edited

Loading

DanielYang59 Nov 9, 2024 •

edited

Loading

DanielYang59 Nov 10, 2024 •

edited

Loading

Avoid using full equality (==) to compare float, avoid assert_array_equal compare float array #4159

Are you sure you want to change the base?

Avoid using full equality (==) to compare float, avoid assert_array_equal compare float array #4159

Conversation

DanielYang59 commented Nov 9, 2024 • edited Loading

Summary

DanielYang59 Nov 9, 2024 • edited Loading

Choose a reason for hiding this comment

DanielYang59 Nov 10, 2024 • edited Loading

Choose a reason for hiding this comment

Avoid using full equality (`==`) to compare float, avoid `assert_array_equal` compare float array #4159

Avoid using full equality (`==`) to compare float, avoid `assert_array_equal` compare float array #4159

DanielYang59 commented Nov 9, 2024 •

edited

Loading

DanielYang59 Nov 9, 2024 •

edited

Loading

DanielYang59 Nov 10, 2024 •

edited

Loading