Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

More improvements to NEL evaluation #30

Open
wricketts opened this issue Aug 25, 2023 · 1 comment
Open

More improvements to NEL evaluation #30

wricketts opened this issue Aug 25, 2023 · 1 comment
Labels
▶️F Migrate to next phase

Comments

@wricketts
Copy link
Contributor

Because

Latest metrics in the 20230824 evaluation are pretty low. This could be due to several reasons:

  • comparison of Span is more strict, since it is based on character offsets rather than the strings themselves. System annotations must have the exact same character offsets as gold annotations in order to be considered a match. Substrings are no longer considered a match.
  • Annotations are additionally compared on their type property (i.e. the category of the System entity must match the gold annotation). Even if a system annotation has the correct Span and KBID, it would still be a miss if the type does not match the gold.

It could be insightful to add a more fine-grained evaluation for each annotation property. Specifically, by computing precision, recall, and F1 for (some options)--

  • Span alone ?
  • Span + KBID ?
  • Span + type ?

If metrics are particularly low for one of these compared to others, it might show where the app could be improved.

Done when

More fine-grained evaluation is implemented (or we decide it's not necessary).

Additional context

No response

@clams-bot clams-bot added this to infra Aug 25, 2023
@github-project-automation github-project-automation bot moved this to Todo in infra Aug 25, 2023
@keighrim
Copy link
Member

keighrim commented Dec 8, 2023

TIL about this; https://www.semantic-web-journal.net/system/files/swj1671.pdf Maybe we need to take a closer look at the library and consider using standardized metrics included.

@keighrim keighrim added the ▶️F Migrate to next phase label Jan 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
▶️F Migrate to next phase
Projects
Status: Todo
Development

No branches or pull requests

2 participants