Quantization fixes by milesial · Pull Request #32 · nvidia-cosmos/cosmos-reason2

milesial · 2025-12-23T01:17:59Z

enables FP8 KV quant (k and v scales, no q or prob)
fixes the default saved max model len (256k)
use static FP8 activation scales instead of dynamic (tensor-wise, minmax) for better perf
direct copy of tokenizer and preprocessing configs to avoid llmcompressor introducing silent truncation

spectralflight

Thanks! A few minor comments, but otherwise LGTM. Please run just lint per https://github.com/nvidia-cosmos/cosmos-reason2/blob/main/CONTRIBUTING.md#test

milesial added 3 commits December 22, 2025 16:47

KV cache quant, fix tokenizer, fix max model len

3e4224c

Config patches

2525f4f

Dynamic FP8 option

31dc2f8

spectralflight self-requested a review January 5, 2026 17:37

spectralflight approved these changes Jan 12, 2026

View reviewed changes

spectralflight reviewed Jan 12, 2026

View reviewed changes

Comment thread scripts/quantize.py

spectralflight reviewed Jan 12, 2026

View reviewed changes

Comment thread scripts/quantize.py

spectralflight reviewed Jan 12, 2026

View reviewed changes

Comment thread scripts/quantize.py

milesial requested a review from spectralflight January 13, 2026 20:04

style: lint and comments

331d4db

spectralflight approved these changes Jan 15, 2026

View reviewed changes

spectralflight merged commit 4caf947 into nvidia-cosmos:main Jan 15, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantization fixes#32

Quantization fixes#32
spectralflight merged 4 commits intonvidia-cosmos:mainfrom
milesial:patch-1

milesial commented Dec 23, 2025

Uh oh!

spectralflight left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

milesial commented Dec 23, 2025

Uh oh!

spectralflight left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

spectralflight left a comment •

edited

Loading