Skip to content

Commit

Permalink
Update fft_cuda.h
Browse files Browse the repository at this point in the history
  • Loading branch information
cliffburdick authored Feb 5, 2025
1 parent 152d22b commit 2a18445
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion include/matx/transforms/fft/fft_cuda.h
Original file line number Diff line number Diff line change
Expand Up @@ -216,7 +216,7 @@ template <typename OutTensorType, typename InTensorType> class matxCUDAFFTPlan_t
if (i.IsContiguous() && o.IsContiguous()) {
// Previously we used cudaMemGetInfo to get free memory to determine batch size. This can be very slow,
// and for small FFTs this call can create extra latency. For now we'll just assume the user knows what
// they're doing and not try to batch FFTs that are too small
// they're doing and not try to batch FFTs that are too large
const auto shape = i.Shape();
params.batch = std::accumulate(std::begin(shape), std::end(shape) - 1, 1, std::multiplies<index_t>());
params.batch_dims = i.Rank() - 1;
Expand Down

0 comments on commit 2a18445

Please sign in to comment.