Skip to content

add top-p and top-k arg #12823

add top-p and top-k arg

add top-p and top-k arg #12823

Annotations

1 warning

test-model-cuda-e2e (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed)  /  linux-job

succeeded Apr 24, 2026 in 29m 38s