Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[CuTeDSL] [demo] workaround of MLIR codegen
#3161 opened Apr 11, 2026 by yinghai Loading…
[CuTeDSL][fix]: 1d bias epilogue fix
#3157 opened Apr 9, 2026 by leevan Loading…
Add absf and floor to cute.math
#3156 opened Apr 8, 2026 by nandor Loading…
Add support for empty dataclass arguments
#3152 opened Apr 7, 2026 by nandor Loading…
Fix incorrect example paths in CuTeDSL docstrings
#3151 opened Apr 6, 2026 by Weili-0234 Loading…
[Hopper CuTeDSL] Add FP8 GEMM with 2xAcc
#3149 opened Apr 5, 2026 by Johnsonms Contributor Loading…
[CuTeDSL] Fix incorrect package-data key in pyproject.toml
#3145 opened Apr 3, 2026 by Johnsonms Contributor Loading…
Fix Hopper FMHA performance regression on CUDA < 13.1
#3137 opened Mar 31, 2026 by arvin-chou Loading…
5 of 6 tasks
feat(CuTeDSL): print benchmark time from Blackwell dense_gemm CLI
#3136 opened Mar 30, 2026 by aidando73 Contributor Loading…
Fix elementwise_apply.py
#3129 opened Mar 25, 2026 by HydraQYH Contributor Loading…
[CuTeDSL] Add SM103 grouped block-scaled GEMM kernel and tests
#3124 opened Mar 23, 2026 by Johnsonms Contributor Loading…
Enable strict C++ compiler warnings with -Werror
#3123 opened Mar 22, 2026 by maxwbuckley Loading…
3 of 4 tasks
[bugfix] use acquire to prevent reordering.
#3118 opened Mar 20, 2026 by shubaoyu2 Contributor Loading…
Fix typo in elementwise_add.py
#3116 opened Mar 20, 2026 by HydraQYH Contributor Loading…
Add FlashMoE Publication
#3115 opened Mar 20, 2026 by osayamenja Loading…
[docs] Fix same typo inactive-30d
#3098 opened Mar 9, 2026 by lhtin Loading…
ProTip! Adding no:label will show everything without a label.