Skip to content

Commit 8bc7212

Browse files
Merge pull request #19 from nvidia-cosmos/spectralflight/update-outputs
Update output assets
2 parents ae7b74a + 8ad94b2 commit 8bc7212

4 files changed

Lines changed: 93 additions & 72 deletions

File tree

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -132,15 +132,15 @@ Start the server:
132132
```shell
133133
uv run vllm serve nvidia/Cosmos-Reason2-2B \
134134
--allowed-local-media-path "$(pwd)" \
135-
--max-model-len 8192 \
135+
--max-model-len 16384 \
136136
--media-io-kwargs '{"video": {"num_frames": -1}}' \
137137
--reasoning-parser qwen3 \
138138
--port 8000
139139
```
140140

141141
Arguments:
142142

143-
* `--max-model-len 8192`: Maximum model length to avoid OOM.
143+
* `--max-model-len 16384`: Maximum model length to avoid OOM. Recommended range: 8192 - 16384.
144144
* `--media-io-kwargs '{"video": {"num_frames": -1}}'`: Allow overriding FPS per sample.
145145
* `--reasoning-parser qwen3`: Parse reasoning trace.
146146
* `--port 8000`: Server port. Change if you encounter `Address already in use` errors.
@@ -150,7 +150,7 @@ Wait a few minutes for the server to startup. Once complete, it will print `Appl
150150
Caption a video ([sample output](assets/outputs/caption.log)):
151151

152152
```shell
153-
uv run cosmos-reason2-inference online --port 8000 -i prompts/caption.yaml --videos assets/sample.mp4 --fps 4
153+
uv run cosmos-reason2-inference online --port 8000 -i prompts/caption.yaml --reasoning --videos assets/sample.mp4 --fps 4
154154
```
155155

156156
Embodied reasoning with verbose output ([sample output](assets/outputs/embodied_reasoning.log)):
@@ -174,7 +174,7 @@ Arguments:
174174
Temporally caption a video and save the input frames to `outputs/temporal_localization` for debugging ([sample output](assets/outputs/temporal_localization.log)):
175175

176176
```shell
177-
uv run cosmos-reason2-inference offline -v --max-model-len 8192 -i prompts/temporal_localization.yaml --videos assets/sample.mp4 --fps 4 -o outputs/temporal_localization
177+
uv run cosmos-reason2-inference offline -v --max-model-len 16384 -i prompts/temporal_localization.yaml --videos assets/sample.mp4 --fps 4 -o outputs/temporal_localization
178178
```
179179

180180
To list available parameters:

assets/outputs/caption.log

Lines changed: 26 additions & 9 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)