fix: Load metrics from centralised folder #15

ruivieira · 2025-01-08T08:45:21Z

In order to perform evaluations in offline mode, the HF evaluate metrics must be present.
Since metrics are loaded from the current working directory, this could cause conflicts whenever a dataset had the same name as the metric (e.g. glue dataset and glue metric). LMEval would try to load the metric as the dataset.
This issue is more prevalent with unitxt, where there is no way to modify the default loading path from LMEval.
In order to avoid conflicts and still use unitxt in offline mode:

The metrics were moved to a central location (metrics, avoiding conflicts)
If using online mode, these local metrics are not used
If using offline mode, the evaluate load method is patched to use the local metrics (or provided folder)

It is possible to provide a custom metrics folder by supplying the LMEVAL_METRICS_PREFIX environment variable. If not specified the bundled metrics folder is used.

Description

Move the metrics to a centralised folder
Modify the evaluate metrics loader to load from supplied path (only in offline mode)

How Has This Been Tested?

Exploratory testing using scenarios in https://github.com/trustyai-explainability/reference/blob/main/lm-eval/LM-EVAL-NEXT.md

Merge criteria:

The commits are squashed in a cohesive manner and have meaningful messages.
Testing instructions have been added in the PR body (for PRs involving changes that are not immediately obvious).
The developer has manually tested the changes and verified that the changes work

lm_eval/__main__.py

danielezonca

lgtm

fix: Load metrics from centralised folder

10c7b20

ruivieira added the kind/enhancement New feature or request label Jan 8, 2025

ruivieira requested review from danielezonca and RobGeada January 8, 2025 08:45

ruivieira self-assigned this Jan 8, 2025

ruivieira mentioned this pull request Jan 8, 2025

move metric modules to metrics dir #14

Closed

3 tasks

danielezonca reviewed Jan 8, 2025

View reviewed changes

lm_eval/__main__.py Outdated Show resolved Hide resolved

ruivieira added 2 commits January 8, 2025 15:50

fix: Add missing jiwer dependency

2c25e99

Add logging to metric loading

6f364a4

danielezonca approved these changes Jan 8, 2025

View reviewed changes

RobGeada approved these changes Jan 8, 2025

View reviewed changes

ruivieira merged commit bc4f819 into opendatahub-io:release-0.4.5 Jan 8, 2025
1 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Load metrics from centralised folder #15

fix: Load metrics from centralised folder #15

ruivieira commented Jan 8, 2025

danielezonca left a comment

fix: Load metrics from centralised folder #15

fix: Load metrics from centralised folder #15

Conversation

ruivieira commented Jan 8, 2025

Description

How Has This Been Tested?

Merge criteria:

danielezonca left a comment

Choose a reason for hiding this comment