fix(charxiv): lazy-init OpenAI client and make model version configur… by MaxwellJryao · Pull Request #1252 · EvolvingLMMs-Lab/lmms-eval

MaxwellJryao · 2026-03-13T12:44:53Z

Summary

Lazy-initialize OpenAI client to avoid SSLContext pickling errors in datasets.map() multiprocessing
Make model version configurable via MODEL_VERSION env var for both descriptive and reasoning grading

In scope

lmms_eval/tasks/charxiv/utils.py: replace module-level client with _get_client() lazy initializer; remove num_proc=1 from dataset.map()
lmms_eval/tasks/charxiv/reasoning_utils.py: add model parameter to get_reasoning_result_gpt()

Out of scope

No changes to grading logic, prompts, or evaluation metrics
No changes to other tasks or model integrations

Validation

Risk / Compatibility

Low risk: lazy init is functionally equivalent; default model value preserves existing behavior
MODEL_VERSION env var now takes effect for reasoning grading (previously ignored)

Type of Change

…able - Replace module-level OpenAI client with lazy _get_client() to avoid SSLContext pickling errors in datasets.map() multiprocessing - Remove num_proc=1 from dataset.map() (no longer needed) - Add model parameter to get_reasoning_result_gpt() so MODEL_VERSION env var is respected for both descriptive and reasoning grading

Luodian merged commit a130a0c into EvolvingLMMs-Lab:main Mar 15, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(charxiv): lazy-init OpenAI client and make model version configur…#1252

fix(charxiv): lazy-init OpenAI client and make model version configur…#1252
Luodian merged 1 commit intoEvolvingLMMs-Lab:mainfrom
MaxwellJryao:fix/charxiv-lazy-client-configurable-model

MaxwellJryao commented Mar 13, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MaxwellJryao commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

In scope

Out of scope

Validation

Risk / Compatibility

Type of Change

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MaxwellJryao commented Mar 13, 2026 •

edited

Loading