-
Notifications
You must be signed in to change notification settings - Fork 300
Pull requests: NVIDIA/cccl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Remove
[[nodiscard]] from barrier's .arrive(...) method
backport branch/3.1.x
backport branch/3.2.x
#6947
opened Dec 11, 2025 by
davebayer
Loading…
[Backport 3.0] Properly specialize cub functions for
__nv_bfloat16 (#6931)
#6946
opened Dec 11, 2025 by
miscco
Loading…
2 tasks
[Backport 3.1] Properly specialize cub functions for
__nv_bfloat16 (#6931)
#6945
opened Dec 11, 2025 by
miscco
Loading…
2 tasks
[Backport 3.0] Add
_CCCL_DECLSPEC_EMPTY_BASES to mdspan features (#6444)
#6944
opened Dec 11, 2025 by
miscco
Loading…
[DRAFT] Optimize offset based DeviceSegmentedReduce for small and medium segment sizes
#6942
opened Dec 11, 2025 by
srinivasyadav18
Loading…
4 tasks
[cuda.compute] Refactor code for creating void* wrappers to avoid ODR violations
#6941
opened Dec 10, 2025 by
shwina
Loading…
2 tasks
[Backport branch/3.2.x] Properly specialize cub functions for
__nv_bfloat16
#6940
opened Dec 10, 2025 by
github-actions
bot
Loading…
[Backport branch/3.2.x] [libcu++] Rename device_transform back to launch_transform
#6937
opened Dec 10, 2025 by
github-actions
bot
Loading…
[Backport branch/3.2.x] [libcu++] Static assert that resource is copyable in buffer constructors
#6936
opened Dec 10, 2025 by
github-actions
bot
Loading…
[Backport branch/3.2.x] Make sure we actually use overflow builtins
#6934
opened Dec 10, 2025 by
github-actions
bot
Loading…
[Backport branch/3.2.x] Add missing nvrtc nv target archs
#6933
opened Dec 10, 2025 by
github-actions
bot
Loading…
[BACKPORT 3.0] Update
cuda/ptx instructions to support all new SM architectures in CTK 13 (#5600)
#6930
opened Dec 10, 2025 by
miscco
Loading…
Expose not guaranteed determinism to reduce in cuda.compute
#6926
opened Dec 9, 2025 by
NaderAlAwar
Loading…
1 of 2 tasks
Add internal Targeted for 3.2.0 release
cuda::__is_device_memory
3.2.0
#6918
opened Dec 8, 2025 by
fbusato
Loading…
Remove need for hardcoded
LevelT for histogram in c.parallel and cuda.compute
#6915
opened Dec 8, 2025 by
NaderAlAwar
Loading…
2 tasks
Implement the new tuning API for DeviceTransform
#6914
opened Dec 8, 2025 by
bernhardmgruber
Loading…
4 of 7 tasks
Add NVTX annotations to cuda.compute user-facing APIs
#6906
opened Dec 8, 2025 by
Copilot
AI
Loading…
1 of 2 tasks
[CUB] Reduce BlockAdjacentDifference shared memory usage by 50%
#6901
opened Dec 7, 2025 by
Aminsed
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.