Skip to content

Add GPU-side Gumbel-max sampling for CUDA graph compatibility #5897

Add GPU-side Gumbel-max sampling for CUDA graph compatibility

Add GPU-side Gumbel-max sampling for CUDA graph compatibility #5897

Annotations

1 warning

export-model-cuda-windows-artifact (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile...  /  linux-job

succeeded Apr 24, 2026 in 27m 30s