Fix some possible thread-id overflow calculations #17473

davidwendt · 2024-12-02T16:51:29Z

Description

Fixes some possible thread-id calculations or usages that may possibly overflow int32 type or size_type.
Reference #10368

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

copy-pr-bot · 2024-12-02T16:51:33Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

davidwendt · 2024-12-02T16:51:57Z

/ok to test

davidwendt · 2024-12-02T16:52:53Z

cpp/src/transform/row_bit_count.cu

@@ -413,7 +413,7 @@ CUDF_KERNEL void compute_segment_sizes(device_span<column_device_view const> col
                                       size_type max_branch_depth)
 {
  extern __shared__ row_span thread_branch_stacks[];
-  int const tid = threadIdx.x + blockIdx.x * blockDim.x;
+  auto const tid = static_cast<size_type>(cudf::detail::grid_1d::global_thread_id());


Merely for clarity. Prefer the type be size_type rather than int.

davidwendt · 2024-12-02T16:53:43Z

cpp/src/transform/jit/kernel.cu

-  thread_index_type const stride = blockDim.x * gridDim.x;
+  auto const block_size          = static_cast<thread_index_type>(blockDim.x);
+  thread_index_type const start  = threadIdx.x + blockIdx.x * block_size;
+  thread_index_type const stride = block_size * gridDim.x;


This is explicit up-casting for multiplication.

davidwendt · 2024-12-02T16:55:33Z

cpp/src/partitioning/partitioning.cu

@@ -138,7 +138,7 @@ CUDF_KERNEL void compute_row_partition_numbers(row_hasher_t the_hasher,
  auto const stride = cudf::detail::grid_1d::grid_stride();

  // Initialize local histogram
-  size_type partition_number = threadIdx.x;
+  thread_index_type partition_number = threadIdx.x;
  while (partition_number < num_partitions) {
    shared_partition_sizes[partition_number] = 0;
    partition_number += blockDim.x;


If num_partitions is close to max<size_type> then partition_number += blockDim.x could overlfow size_type.

davidwendt · 2024-12-02T20:23:02Z

/ok to test

davidwendt · 2024-12-04T18:54:43Z

/ok to test

davidwendt · 2024-12-04T21:35:08Z

/ok to test

davidwendt · 2024-12-05T18:53:34Z

/ok to test

davidwendt · 2024-12-06T13:36:04Z

/ok to test

shrshi

Looks good to me!

…erflow

davidwendt · 2024-12-11T22:32:24Z

/merge

Fix some possible thread-id overflow calculations

6ba8176

davidwendt added bug Something isn't working 2 - In Progress Currently a work in progress libcudf Affects libcudf (C++/CUDA) code. non-breaking Non-breaking change labels Dec 2, 2024

davidwendt self-assigned this Dec 2, 2024

davidwendt commented Dec 2, 2024

View reviewed changes

davidwendt mentioned this pull request Dec 2, 2024

Prevent grid stride loop overflow in libcudf kernels #10368

Open

add copy-if-else

d5f028a

Merge branch 'branch-25.02' into tid-overflow

a86ae23

Merge branch 'branch-25.02' into tid-overflow

5a48d96

davidwendt added 2 commits December 5, 2024 13:47

Merge branch 'branch-25.02' into tid-overflow

29246b3

add tdigest change

5ad53ff

davidwendt added 2 commits December 6, 2024 08:32

Merge branch 'branch-25.02' into tid-overflow

356eb3b

Merge branch 'branch-25.02' into tid-overflow

0eeb9a1

davidwendt added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currently a work in progress labels Dec 6, 2024

Merge branch 'branch-25.02' into tid-overflow

5d58e2f

davidwendt marked this pull request as ready for review December 9, 2024 18:59

davidwendt requested a review from a team as a code owner December 9, 2024 18:59

davidwendt requested a review from shrshi December 9, 2024 18:59

davidwendt requested a review from vuule December 9, 2024 18:59

vuule approved these changes Dec 9, 2024

View reviewed changes

shrshi approved these changes Dec 10, 2024

View reviewed changes

davidwendt and others added 4 commits December 10, 2024 12:37

Merge branch 'branch-25.02' into tid-overflow

3547734

Merge branch 'branch-25.02' into tid-overflow

fc284b0

Merge branch 'tid-overflow' of github.com:davidwendt/cudf into tid-ov…

3c6357d

…erflow

Merge branch 'branch-25.02' into tid-overflow

d2c302e

rapids-bot bot merged commit 63c5a38 into rapidsai:branch-25.02 Dec 11, 2024
105 checks passed

davidwendt deleted the tid-overflow branch December 11, 2024 22:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix some possible thread-id overflow calculations #17473

Fix some possible thread-id overflow calculations #17473

davidwendt commented Dec 2, 2024

copy-pr-bot bot commented Dec 2, 2024

davidwendt commented Dec 2, 2024

davidwendt Dec 2, 2024

davidwendt Dec 2, 2024

davidwendt Dec 2, 2024

davidwendt commented Dec 2, 2024

davidwendt commented Dec 4, 2024

davidwendt commented Dec 4, 2024

davidwendt commented Dec 5, 2024

davidwendt commented Dec 6, 2024

shrshi left a comment

davidwendt commented Dec 11, 2024

Fix some possible thread-id overflow calculations #17473

Fix some possible thread-id overflow calculations #17473

Conversation

davidwendt commented Dec 2, 2024

Description

Checklist

copy-pr-bot bot commented Dec 2, 2024

davidwendt commented Dec 2, 2024

davidwendt Dec 2, 2024

Choose a reason for hiding this comment

davidwendt Dec 2, 2024

Choose a reason for hiding this comment

davidwendt Dec 2, 2024

Choose a reason for hiding this comment

davidwendt commented Dec 2, 2024

davidwendt commented Dec 4, 2024

davidwendt commented Dec 4, 2024

davidwendt commented Dec 5, 2024

davidwendt commented Dec 6, 2024

shrshi left a comment

Choose a reason for hiding this comment

davidwendt commented Dec 11, 2024