More improvements to NEL evaluation #30

wricketts · 2023-08-25T19:54:49Z

Because

Latest metrics in the 20230824 evaluation are pretty low. This could be due to several reasons:

comparison of Span is more strict, since it is based on character offsets rather than the strings themselves. System annotations must have the exact same character offsets as gold annotations in order to be considered a match. Substrings are no longer considered a match.
Annotations are additionally compared on their type property (i.e. the category of the System entity must match the gold annotation). Even if a system annotation has the correct Span and KBID, it would still be a miss if the type does not match the gold.

It could be insightful to add a more fine-grained evaluation for each annotation property. Specifically, by computing precision, recall, and F1 for (some options)--

Span alone ?
Span + KBID ?
Span + type ?

If metrics are particularly low for one of these compared to others, it might show where the app could be improved.

Done when

More fine-grained evaluation is implemented (or we decide it's not necessary).

Additional context

No response

The text was updated successfully, but these errors were encountered:

keighrim · 2023-12-08T07:12:28Z

TIL about this; https://www.semantic-web-journal.net/system/files/swj1671.pdf Maybe we need to take a closer look at the library and consider using standardized metrics included.

clams-bot added this to infra Aug 25, 2023

github-project-automation bot moved this to Todo in infra Aug 25, 2023

keighrim added the ▶️F Migrate to next phase label Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More improvements to NEL evaluation #30

More improvements to NEL evaluation #30

wricketts commented Aug 25, 2023

keighrim commented Dec 8, 2023

More improvements to NEL evaluation #30

More improvements to NEL evaluation #30

Comments

wricketts commented Aug 25, 2023

Because

Done when

Additional context

keighrim commented Dec 8, 2023