`sum_over_batch_size` clarification #18818

jackd · 2023-11-23T03:49:29Z

Losses all take a reduction argument which can either be sum or sum_over_batch_size. sum is fairly straight forward, and there are no suprises with the implemention - a summation, or a weighted summation if sample_weight or mask is present. sum_over_batch_size is ambiguously named and inconsistent with the implementation.

Ambiguity: I originally thought it meant "sum over the batch dimension". Looking at previous implementations, it looks like it's meant to mean "sum divided by the batch size". The keras 3.0 implementation looks like it just computes a weighted mean.

If it's meant to be the mean, why not call it "mean"? If it's not meant to be the mean, then consider this a bug report, because the current implementation just does that.

Note I'm not just being pedantic - I want to submit a PR that mixes masking (multiplication by zero is not masking when infs and nans are around), but I need to know exactly what the implementation is supposed to be to fix this.

The text was updated successfully, but these errors were encountered:

fchollet · 2023-11-24T19:06:45Z

sum_over_batch_size and mean are the same thing. It should more naturally be called mean but we kept the Keras 2 terminology for backwards compatibility.

jackd · 2023-11-30T06:51:17Z

For anyone coming back here in the future: I actually prefer sum_over_batch_size to mean now, because the weighted interpretations are different. While a sum reduction with sample_weights is interpreted as a weighted sum, a sum_over_batch_size is interpreted as a weighted sum divided by the number of unmasked entries (the "batch size"), not the weighted mean.

github-actions bot assigned SuryanarayanaY Nov 23, 2023

jackd closed this as completed Nov 24, 2023

jm-willy mentioned this issue Oct 14, 2024

Clarification: new mean option is synonym with sum_over_batch_size in loss function base class #20352

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`sum_over_batch_size` clarification #18818

`sum_over_batch_size` clarification #18818

jackd commented Nov 23, 2023

fchollet commented Nov 24, 2023

jackd commented Nov 30, 2023

sum_over_batch_size clarification #18818

sum_over_batch_size clarification #18818

Comments

jackd commented Nov 23, 2023

fchollet commented Nov 24, 2023

jackd commented Nov 30, 2023

`sum_over_batch_size` clarification #18818

`sum_over_batch_size` clarification #18818