Skip to content

Add GPU-side Gumbel-max sampling for CUDA graph compatibility #12822

Add GPU-side Gumbel-max sampling for CUDA graph compatibility

Add GPU-side Gumbel-max sampling for CUDA graph compatibility #12822

Annotations

1 warning

test-model-cuda-e2e (nvidia, diar_streaming_sortformer_4spk-v2, non-quantized)  /  linux-job

succeeded Apr 24, 2026 in 14m 29s