Skip to content

Add GPU-side Gumbel-max sampling for CUDA graph compatibility #12822

Add GPU-side Gumbel-max sampling for CUDA graph compatibility

Add GPU-side Gumbel-max sampling for CUDA graph compatibility #12822

Annotations

1 warning

export-model-cuda-artifact (SocialLocalMobile, Qwen3.5-35B-A3B-HQQ-INT4, quantized-int4-tile-packed)  /  linux-job

succeeded Apr 24, 2026 in 58m 12s