Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Replace CUDA driver API with runtime API
#2928 opened Jan 4, 2026 by depaulmillz Loading…
remove useless (void) mark
#2926 opened Jan 4, 2026 by veritas-Qiu Loading…
[doc] N / scale N granularity typo?
#2924 opened Jan 3, 2026 by aidando73 Loading…
Fix incorrect tensor layout strides in Blackwell MMA tutorial comments
#2921 opened Jan 3, 2026 by Johnsonms Loading…
6 of 8 tasks
New RMS Norm example with unit tests
#2917 opened Jan 1, 2026 by bkryu Loading…
feat(examples/test_run): use runtime sm arch
#2916 opened Dec 31, 2025 by tpoisonooo Loading…
Fix idx2crd docstring
#2914 opened Dec 30, 2025 by Edenzzzz Loading…
[Doc]Fix Mode Name and Stride in 0t_mma_atom.md
#2910 opened Dec 27, 2025 by HydraQYH Loading…
Fix CUDA version checking in examples
#2894 opened Dec 21, 2025 by aychun Loading…
docs: note when DSL dumps are populated
#2891 opened Dec 20, 2025 by ColinPeppler Loading…
Fix finding cuDNN
#2890 opened Dec 19, 2025 by TLescoatTFX Loading…
fix typo
#2884 opened Dec 17, 2025 by kf-zhang Loading…
Remove redundant "from" from comment
#2853 opened Dec 8, 2025 by crcrpar Loading…
Add spin_lock_atom_cas_acquire_wait function
#2846 opened Dec 5, 2025 by aleozlx Loading…
Remove deprecated newshape argument. inactive-30d
#2844 opened Dec 4, 2025 by Artem-B Loading…
[CuTeDSL] Feature/fp8e4m3 to fp16 conversion
#2822 opened Nov 28, 2025 by arseniivanov Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.