Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace CUB_DETAIL_COUNT by _CCCL_PP_COUNT #2970

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

bernhardmgruber
Copy link
Contributor

CUB_DETAIL_COUNT was removed at some point, but not replaced everywhere.

CUB_DETAIL_COUNT was removed at some point, but not replaced everywhere.
@bernhardmgruber bernhardmgruber requested review from a team as code owners November 27, 2024 00:07
@bernhardmgruber bernhardmgruber added cub For all items related to CUB benchmark Feature related to benchmarking our libraries labels Nov 27, 2024
Copy link
Contributor

🟩 CI finished in 2h 08m: Pass: 100%/224 | Total: 1d 01h | Avg: 6m 54s | Max: 45m 01s | Hits: 99%/12288
  • 🟩 thrust: Pass: 100%/111 | Total: 12h 44m | Avg: 6m 53s | Max: 45m 01s | Hits: 99%/9260

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 18m 36s | Avg:  9m 18s | Max: 12m 59s
    🟩 cpu
      🟩 amd64              Pass: 100%/103 | Total: 12h 05m | Avg:  7m 02s | Max: 45m 01s | Hits:  99%/9260  
      🟩 arm64              Pass: 100%/8   | Total: 38m 56s | Avg:  4m 52s | Max:  5m 28s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 20m | Avg:  5m 22s | Max: 19m 04s | Hits:  99%/1852  
      🟩 11.8               Pass: 100%/3   | Total: 16m 09s | Avg:  5m 23s | Max:  6m 02s
      🟩 12.5               Pass: 100%/4   | Total:  1h 02m | Avg: 15m 31s | Max: 16m 37s
      🟩 12.6               Pass: 100%/89  | Total: 10h 05m | Avg:  6m 48s | Max: 45m 01s | Hits:  99%/7408  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 20m 08s | Avg:  5m 02s | Max:  5m 21s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 20m | Avg:  5m 22s | Max: 19m 04s | Hits:  99%/1852  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 09s | Avg:  5m 23s | Max:  6m 02s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 02m | Avg: 15m 31s | Max: 16m 37s
      🟩 nvcc12.6           Pass: 100%/85  | Total:  9h 45m | Avg:  6m 52s | Max: 45m 01s | Hits:  99%/7408  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 20m 08s | Avg:  5m 02s | Max:  5m 21s
      🟩 nvcc               Pass: 100%/107 | Total: 12h 24m | Avg:  6m 57s | Max: 45m 01s | Hits:  99%/9260  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 32m 58s | Avg:  5m 29s | Max:  7m 32s
      🟩 Clang10            Pass: 100%/3   | Total: 18m 11s | Avg:  6m 03s | Max:  6m 21s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 12s | Avg:  5m 03s | Max:  5m 32s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 01s | Avg:  5m 00s | Max:  5m 21s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 32s | Avg:  5m 08s | Max:  5m 41s
      🟩 Clang14            Pass: 100%/4   | Total: 20m 18s | Avg:  5m 04s | Max:  5m 31s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 02s | Avg:  5m 15s | Max:  5m 47s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 21s | Avg:  5m 20s | Max:  5m 29s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 23s | Avg:  5m 20s | Max:  5m 49s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 06m | Avg:  6m 05s | Max: 15m 34s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 41s | Avg:  4m 20s | Max:  4m 42s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 08s | Avg:  4m 31s | Max:  5m 21s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 43s | Avg:  4m 47s | Max:  5m 17s
      🟩 GCC9               Pass: 100%/6   | Total: 30m 03s | Avg:  5m 00s | Max:  6m 04s
      🟩 GCC10              Pass: 100%/4   | Total: 21m 47s | Avg:  5m 26s | Max:  5m 54s
      🟩 GCC11              Pass: 100%/7   | Total: 38m 53s | Avg:  5m 33s | Max:  6m 36s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 30s | Avg:  5m 37s | Max:  6m 05s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 27m | Avg:  9m 12s | Max: 45m 01s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 20m 54s | Avg:  6m 58s | Max:  7m 32s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 19m 04s | Avg: 19m 04s | Max: 19m 04s | Hits:  99%/1852  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 33m 39s | Avg: 16m 49s | Max: 17m 56s | Hits:  99%/3704  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 40m 26s | Avg: 20m 13s | Max: 22m 27s | Hits:  99%/3704  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 02m | Avg: 15m 31s | Max: 16m 37s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 22m | Avg:  5m 28s | Max: 15m 34s
      🟩 GCC                Pass: 100%/51  | Total:  5h 25m | Avg:  6m 22s | Max: 45m 01s
      🟩 Intel              Pass: 100%/3   | Total: 20m 54s | Avg:  6m 58s | Max:  7m 32s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 33m | Avg: 18m 37s | Max: 22m 27s | Hits:  99%/9260  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 02m | Avg: 15m 31s | Max: 16m 37s
    🟩 gpu
      🟩 v100               Pass: 100%/111 | Total: 12h 44m | Avg:  6m 53s | Max: 45m 01s | Hits:  99%/9260  
    🟩 jobs
      🟩 Build              Pass: 100%/103 | Total: 11h 05m | Avg:  6m 27s | Max: 45m 01s | Hits:  99%/7408  
      🟩 TestCPU            Pass: 100%/4   | Total: 45m 32s | Avg: 11m 23s | Max: 22m 27s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/4   | Total: 53m 21s | Avg: 13m 20s | Max: 15m 34s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 09s | Avg:  5m 23s | Max:  6m 02s
      🟩 90a                Pass: 100%/4   | Total: 18m 03s | Avg:  4m 30s | Max:  4m 49s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 39m | Avg:  5m 18s | Max: 13m 46s
      🟩 14                 Pass: 100%/29  | Total:  3h 06m | Avg:  6m 25s | Max: 19m 04s | Hits:  99%/3704  
      🟩 17                 Pass: 100%/27  | Total:  2h 51m | Avg:  6m 22s | Max: 17m 56s | Hits:  99%/1852  
      🟩 20                 Pass: 100%/23  | Total:  3h 47m | Avg:  9m 54s | Max: 45m 01s | Hits:  99%/3704  
    
  • 🟩 cub: Pass: 100%/110 | Total: 12h 38m | Avg: 6m 53s | Max: 30m 32s | Hits: 99%/3028

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total: 11h 59m | Avg:  7m 03s | Max: 30m 32s | Hits:  99%/3028  
      🟩 arm64              Pass: 100%/8   | Total: 39m 20s | Avg:  4m 55s | Max:  5m 39s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 15m | Avg:  5m 00s | Max: 14m 28s | Hits:  99%/757   
      🟩 11.8               Pass: 100%/3   | Total: 16m 36s | Avg:  5m 32s | Max:  5m 55s
      🟩 12.5               Pass: 100%/4   | Total: 38m 54s | Avg:  9m 43s | Max: 10m 35s
      🟩 12.6               Pass: 100%/88  | Total: 10h 27m | Avg:  7m 08s | Max: 30m 32s | Hits:  99%/2271  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 16m 43s | Avg:  4m 10s | Max:  4m 20s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 15m | Avg:  5m 00s | Max: 14m 28s | Hits:  99%/757   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 36s | Avg:  5m 32s | Max:  5m 55s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 38m 54s | Avg:  9m 43s | Max: 10m 35s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 10h 11m | Avg:  7m 16s | Max: 30m 32s | Hits:  99%/2271  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 16m 43s | Avg:  4m 10s | Max:  4m 20s
      🟩 nvcc               Pass: 100%/106 | Total: 12h 21m | Avg:  6m 59s | Max: 30m 32s | Hits:  99%/3028  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 31m 43s | Avg:  5m 17s | Max:  6m 30s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 11s | Avg:  6m 23s | Max:  6m 47s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 26s | Avg:  5m 06s | Max:  5m 24s
      🟩 Clang12            Pass: 100%/4   | Total: 21m 06s | Avg:  5m 16s | Max:  5m 29s
      🟩 Clang13            Pass: 100%/4   | Total: 21m 23s | Avg:  5m 20s | Max:  5m 35s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 11s | Avg:  5m 17s | Max:  5m 38s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 27s | Avg:  5m 21s | Max:  5m 32s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 46s | Avg:  5m 26s | Max:  5m 34s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 27s | Avg:  5m 21s | Max:  5m 34s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 32m | Avg:  8m 23s | Max: 30m 32s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 40s | Avg:  4m 20s | Max:  4m 28s
      🟩 GCC7               Pass: 100%/6   | Total: 28m 19s | Avg:  4m 43s | Max:  5m 27s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 19s | Avg:  4m 43s | Max:  5m 15s
      🟩 GCC9               Pass: 100%/6   | Total: 31m 04s | Avg:  5m 10s | Max:  6m 31s
      🟩 GCC10              Pass: 100%/4   | Total: 22m 18s | Avg:  5m 34s | Max:  5m 46s
      🟩 GCC11              Pass: 100%/7   | Total: 39m 06s | Avg:  5m 35s | Max:  6m 05s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 46s | Avg:  5m 41s | Max:  5m 56s
      🟩 GCC13              Pass: 100%/16  | Total:  2h 54m | Avg: 10m 53s | Max: 28m 42s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 32s | Avg:  6m 30s | Max:  6m 43s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 28s | Avg: 14m 28s | Max: 14m 28s | Hits:  99%/757   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 25m 03s | Avg: 12m 31s | Max: 12m 57s | Hits:  99%/1514  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 45s | Avg: 13m 45s | Max: 13m 45s | Hits:  99%/757   
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 38m 54s | Avg:  9m 43s | Max: 10m 35s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 51m | Avg:  6m 04s | Max: 30m 32s
      🟩 GCC                Pass: 100%/51  | Total:  5h 54m | Avg:  6m 57s | Max: 28m 42s
      🟩 Intel              Pass: 100%/3   | Total: 19m 32s | Avg:  6m 30s | Max:  6m 43s
      🟩 MSVC               Pass: 100%/4   | Total: 53m 16s | Avg: 13m 19s | Max: 14m 28s | Hits:  99%/3028  
      🟩 NVHPC              Pass: 100%/4   | Total: 38m 54s | Avg:  9m 43s | Max: 10m 35s
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total: 12h 38m | Avg:  6m 53s | Max: 30m 32s | Hits:  99%/3028  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  9h 42m | Avg:  5m 42s | Max: 14m 28s | Hits:  99%/3028  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 20m 15s | Avg: 20m 15s | Max: 20m 15s
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 41s | Avg: 18m 41s | Max: 18m 41s
      🟩 HostLaunch         Pass: 100%/3   | Total: 56m 30s | Avg: 18m 50s | Max: 20m 53s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 20m | Avg: 26m 55s | Max: 30m 32s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 36s | Avg:  5m 32s | Max:  5m 55s
      🟩 90a                Pass: 100%/4   | Total: 17m 18s | Avg:  4m 19s | Max:  4m 41s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 02m | Avg:  6m 04s | Max: 21m 32s
      🟩 14                 Pass: 100%/29  | Total:  2h 51m | Avg:  5m 54s | Max: 14m 28s | Hits:  99%/1514  
      🟩 17                 Pass: 100%/27  | Total:  2h 36m | Avg:  5m 47s | Max: 12m 06s | Hits:  99%/757   
      🟩 20                 Pass: 100%/24  | Total:  4h 08m | Avg: 10m 21s | Max: 30m 32s | Hits:  99%/757   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 14s | Avg: 5m 07s | Max: 8m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 14s | Avg:  5m 07s | Max:  8m 01s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 14s | Avg:  5m 07s | Max:  8m 01s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 14s | Avg:  5m 07s | Max:  8m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 14s | Avg:  5m 07s | Max:  8m 01s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 14s | Avg:  5m 07s | Max:  8m 01s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 14s | Avg:  5m 07s | Max:  8m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 14s | Avg:  5m 07s | Max:  8m 01s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 13s | Avg:  2m 13s | Max:  2m 13s
      🟩 Test               Pass: 100%/1   | Total:  8m 01s | Avg:  8m 01s | Max:  8m 01s
    
  • 🟩 python: Pass: 100%/1 | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 44s | Avg: 15m 44s | Max: 15m 44s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 224)

# Runner
185 linux-amd64-cpu16
16 linux-arm64-cpu16
14 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
benchmark Feature related to benchmarking our libraries cub For all items related to CUB
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

1 participant