fix distributed evaluation on empty task shards by Luodian · Pull Request #1233 · EvolvingLMMs-Lab/lmms-eval

Luodian · 2026-03-07T06:29:00Z

Summary

Fix distributed evaluation when a rank receives zero documents after task sharding.
Keep request construction and synchronization aligned by injecting a padding request for empty shards.
Scope the change to task request building and evaluator synchronization.

In scope

Update lmms_eval/api/task.py to preserve distributed synchronization for empty shards.
Update lmms_eval/evaluator.py to keep request/filter coordination consistent across ranks.
Include the formatter adjustments required by the existing lint rules.

Out of scope

vLLM backend dispatch changes.
Benchmark-specific scoring or prompt changes.

Validation

uv run --with pytest python -m pytest test/eval/test_construct_requests.py -q | sample size: N=23 tests | key metrics: 23 passed | result: pass
uv run python - <<'PY' import lmms_eval.evaluator as evaluator print(hasattr(evaluator, 'evaluate')) PY | sample size: N=1 smoke check | key metrics: evaluate=True | result: pass
uv run pre-commit run --all-files | sample size: N=all tracked files | key metrics: black, isort passed | result: pass

Risk / Compatibility

Moderate behavior change: distributed runs with empty shards now stay alive instead of diverging or hanging.
The new padding path only applies when a rank has no documents, so normal single-rank and balanced runs are unaffected.

Type of Change

Inject a padding request when a rank receives zero docs and align request/filter synchronization across ranks so TP+DP jobs with limit<=world_size no longer crash or hang.

Luodian and others added 2 commits March 7, 2026 14:26

fix(evaluator): keep distributed runs alive on empty task shards

32349f3

Inject a padding request when a rank receives zero docs and align request/filter synchronization across ranks so TP+DP jobs with limit<=world_size no longer crash or hang.

style: auto-fix lint (black + isort)

4fe92ec

Luodian marked this pull request as ready for review March 7, 2026 06:31

Luodian merged commit e867638 into EvolvingLMMs-Lab:main Mar 7, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix distributed evaluation on empty task shards#1233

fix distributed evaluation on empty task shards#1233
Luodian merged 2 commits intoEvolvingLMMs-Lab:mainfrom
Luodian:codex/evaluator-empty-shards

Luodian commented Mar 7, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Luodian commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

In scope

Out of scope

Validation

Risk / Compatibility

Type of Change

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Luodian commented Mar 7, 2026 •

edited

Loading