docs: clarify that `sqrt` must be correctly rounded in accordance with IEEE 754 #882

kgryte · 2025-01-09T11:06:29Z

This PR:

closes Specify correct rounding for sqrt #826 and closes Minor clarification on allowed rounding mode #830
clarifies that sqrt should follow IEEE 754 and always return correctly rounded result. This was implied (i.e., conforming implementations should be IEEE 754 compliant), but never made explicit.
clarifies that accuracy requirements apply to real-valued floating-point operands and not complex-valued floating-point operands.
adds missing functions to list of functions not covered by accuracy requirements.
specifies that subnormals may or may not be supported.

rgommers · 2025-01-09T11:14:33Z

Given #826 (comment) says that the default for single-precision sqrt in CUDA isn't to round with this precision, I'm not sure how useful/achievable this will be. @leofang any comments on that?

hpkfft · 2025-01-09T19:27:02Z

The default for single-precision divide in CUDA isn't to round with this precision either.
Nevertheless, this spec did the right thing by requiring CUDA libraries to be compiled with the flag that enables correct rounding.
It should be equally useful/achievable for sqrt.

spec/draft/design_topics/accuracy.rst

hpkfft · 2025-01-09T19:54:28Z

I want to comment that correctly rounded is defined by IEEE 754-2019 as follows:

correct rounding: This standard’s method of converting an infinitely precise result to a floating-point
number, as determined by the applicable rounding direction. A floating-point number so obtained is said to
be correctly rounded.

leofang · 2025-01-09T21:24:39Z

Given #826 (comment) says that the default for single-precision sqrt in CUDA isn't to round with this precision, I'm not sure how useful/achievable this will be. @leofang any comments on that?

The default is to do correct rounding, as @hpkfft noted. However, it involves to not set -ftz=true which is a requirement we've discussed to avoid, ex:

array-api/src/array_api_stubs/_draft/elementwise_functions.py

Lines 1478 to 1481 in 532db5b

    
               .. note:: 
        
                  IEEE 754-2019 requires support for subnormal (a.k.a., denormal) numbers, which are useful for supporting gradual underflow. However, hardware support for subnormal numbers is not universal, and many platforms (e.g., accelerators) and compilers support toggling denormals-are-zero (DAZ) and/or flush-to-zero (FTZ) behavior to increase performance and to guard against timing attacks. 
        
                  Accordingly, conforming implementations may vary in their support for subnormal numbers.

Therefore, I do not think the "correctly rounded" requirement is achievable. Certainly it is not CuPy's default.

Ref: https://docs.nvidia.com/cuda/floating-point/index.html#compiler-flags

leofang

(see above)

hpkfft · 2025-01-10T00:21:31Z

Denormal result support (as opposed to flush to zero (ftz)) is orthogonal. It's not specific to square root, but rather applies to everything: addition, subtraction, ....
Yes, flushing to zero gives zero, which is neither the correctly rounded result nor the nearest representable value.
I think the note makes it clear that this is a global exception to requiring IEEE 754-2019 conformance. (Of course, suggestions to improve/clarify the wording are welcome if you feel that would be helpful.)
Otherwise, when the result is normal, I think it would be strange for this spec to deviate from the IEEE 754-2019 requirement of correct rounding for add, sub, mul, div, and sqrt. As @kgryte pointed out, conformance was already implied by this spec, and this PR merely makes it explicit.

hpkfft · 2025-01-10T01:09:38Z

Oh, I think the note you referenced is not in the spec itself?
I agree that the spec should explicitly mention that denormal support is not a requirement, i.e., ftz and/or daz are acceptable. [The former applies to denormal results and the latter applies to denormal inputs.]

But, I would suggest that your observation ought to be considered a separate bug/PR and not block this PR.
It might be the case that denormal support requires discussion.

leofang · 2025-01-21T15:02:29Z

Hi @hpkfft @kgryte, sorry for my late reply. I reached out to our math team (at NVIDIA) asking for clarification, and I believe what @hpkfft stated above is generally correct. Denormals can be flushed while still keeping the rounding behavior correct, at least for sqrt this is documented in the PTX instruction:
https://docs.nvidia.com/cuda/parallel-thread-execution/#floating-point-instructions-sqrt
(both sqrt.rnd.f32 and sqrt.rnd.ftz.f32 are considered IEEE compliant). With this understanding, @kgryte could you please kindly apply @hpkfft's suggestions, then we can approve/merge?

…at/sqrt-accuracy

kgryte · 2025-02-06T05:06:26Z

@leofang and @hpkfft I've updated the PR accordingly. If you can give it a once over, that would be appreciated!

hpkfft

Looks great! Thank you.

kgryte · 2025-02-06T05:47:04Z

Thanks for the review @hpkfft! As the changes implement the suggestions discussed above, I'll go ahead and merge. Thanks all!

leofang · 2025-02-06T15:05:41Z

Thanks to you both 🙏

docs: require that sqrt be correctly rounded in accordance with IEE…

2296a03

…E 754 Closes: data-apis#826

kgryte mentioned this pull request Jan 9, 2025

Specify correct rounding for sqrt #826

Closed

kgryte added this to the v2024 milestone Jan 9, 2025

hpkfft reviewed Jan 9, 2025

View reviewed changes

spec/draft/design_topics/accuracy.rst Outdated Show resolved Hide resolved

hpkfft reviewed Jan 9, 2025

View reviewed changes

spec/draft/design_topics/accuracy.rst Outdated Show resolved Hide resolved

leofang requested changes Jan 9, 2025

View reviewed changes

kgryte added 4 commits February 5, 2025 20:48

Merge branch 'main' of https://github.com/data-apis/array-api into fe…

a2a0e63

…at/sqrt-accuracy

docs: update copy to use "correctly rounded"

ced123a

docs: include blurb regarding subnormal support

10c6d26

docs: specify that results must be "correctly rounded"

c85425e

kgryte mentioned this pull request Feb 6, 2025

Minor clarification on allowed rounding mode #830

Closed

kgryte requested a review from leofang February 6, 2025 05:05

hpkfft approved these changes Feb 6, 2025

View reviewed changes

kgryte removed the request for review from leofang February 6, 2025 05:46

kgryte merged commit 4dccde5 into data-apis:main Feb 6, 2025
3 checks passed

kgryte deleted the feat/sqrt-accuracy branch February 6, 2025 05:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: clarify that `sqrt` must be correctly rounded in accordance with IEEE 754 #882

docs: clarify that `sqrt` must be correctly rounded in accordance with IEEE 754 #882

kgryte commented Jan 9, 2025 •

edited

Loading

rgommers commented Jan 9, 2025

hpkfft commented Jan 9, 2025

hpkfft commented Jan 9, 2025

leofang commented Jan 9, 2025

leofang left a comment

hpkfft commented Jan 10, 2025

hpkfft commented Jan 10, 2025

leofang commented Jan 21, 2025

kgryte commented Feb 6, 2025

hpkfft left a comment

kgryte commented Feb 6, 2025

leofang commented Feb 6, 2025

docs: clarify that sqrt must be correctly rounded in accordance with IEEE 754 #882

docs: clarify that sqrt must be correctly rounded in accordance with IEEE 754 #882

Conversation

kgryte commented Jan 9, 2025 • edited Loading

rgommers commented Jan 9, 2025

hpkfft commented Jan 9, 2025

hpkfft commented Jan 9, 2025

leofang commented Jan 9, 2025

leofang left a comment

Choose a reason for hiding this comment

hpkfft commented Jan 10, 2025

hpkfft commented Jan 10, 2025

leofang commented Jan 21, 2025

kgryte commented Feb 6, 2025

hpkfft left a comment

Choose a reason for hiding this comment

kgryte commented Feb 6, 2025

leofang commented Feb 6, 2025

docs: clarify that `sqrt` must be correctly rounded in accordance with IEEE 754 #882

docs: clarify that `sqrt` must be correctly rounded in accordance with IEEE 754 #882

kgryte commented Jan 9, 2025 •

edited

Loading