fix(vllm_rollout): Filter prompts exceeding max_model_len instead of … by Pritiks23 · Pull Request #690 · nvidia-cosmos/cosmos-rl

Pritiks23 · 2026-05-21T22:30:31Z

Fixes issue #403 - vLLM now raises an error when decoder prompt length exceeds max_model_len, especially with large images/videos in VLM tasks. This fix filters out oversized prompts before passing them to vLLM, preventing errors while logging warnings about skipped samples.

Changes:

Added prompt length validation in rollout_generation_single_turn()
Added prompt length validation in async rollout_generation()
Supports both LLM (string/dict prompts) and VLM (image/video inputs)
Logs warnings when samples are skipped due to exceeding max_model_len
Gracefully handles edge cases where token count cannot be determined

…raising error Fixes issue nvidia-cosmos#403 - vLLM now raises an error when decoder prompt length exceeds max_model_len, especially with large images/videos in VLM tasks. This fix filters out oversized prompts before passing them to vLLM, preventing errors while logging warnings about skipped samples. Changes: - Added prompt length validation in rollout_generation_single_turn() - Added prompt length validation in async rollout_generation() - Supports both LLM (string/dict prompts) and VLM (image/video inputs) - Logs warnings when samples are skipped due to exceeding max_model_len - Gracefully handles edge cases where token count cannot be determined

Pritiks23 mentioned this pull request May 21, 2026

Vllm raise error when token_num exceed max_model_len #403

Open

Pritiks23 added 2 commits May 22, 2026 18:40

style(vllm_rollout): apply ruff-format for pre-commit

4c2abb1

Pritiks23 force-pushed the fix/vllm-max-model-len-filtering branch from 5876831 to 4c2abb1 Compare May 22, 2026 18:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(vllm_rollout): Filter prompts exceeding max_model_len instead of …#690

fix(vllm_rollout): Filter prompts exceeding max_model_len instead of …#690
Pritiks23 wants to merge 2 commits into
nvidia-cosmos:mainfrom
Pritiks23:fix/vllm-max-model-len-filtering

Pritiks23 commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Pritiks23 commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant