RFB evaluation #66

bohJiang12 · 2024-07-26T15:45:41Z

This PR aims to target issue #51

Implementations

evaluate_alt.py: the alternative version of evaluation script, which borrows ideas of evaluate.py for calculating IOU. Its main functions include:
- Load both prediction results run by RFB and gold-standard csv data, and return them as the same format -- {GUID: frame_num: (role, filler)} so that it's convenient for calculating IOU
- The Intersection Over Union (IOU) evaluation metric, but missed the Dice-Sorensen Coefficient (DSC) for now.
- The utility of writing out results.txt

All edge cases suffered now are from csv string (i.e. annotations part), and they're

NaN value: this missing value occurs in either role or filler fields. -> Solution: replace this missing value as the string 'nan' while loading prediction results, but drop such row having it in loading gold results
Address, Name, Company's name including comma: this confused how to successfully split role and filler fields from a csv string based on the rule of exporting gold data csv in process.py -> Solution: use str.split(',', maxsplit=2) to force the string is split into three fields: ('', role, filler)

In particular, I implemented a function get_aligned_ann_of for supporting find aligned annotation based on a "source" annotation cross views. Appreciate any feedback.
The possible improvements can be
- adding DSC metric
- cleaner code for loading data

bohJiang12 · 2024-08-16T21:39:38Z

@keighrim I just added the report, please take a look at that. Let me know if there's anything that I can improve.

MrSqually and others added 2 commits June 24, 2024 11:49

gold-loading and metric infra

1eafaee

First implemented evaluation script

83a8e76

bohJiang12 requested a review from keighrim July 26, 2024 15:47

keighrim mentioned this pull request Jul 29, 2024

#89 --- RFB gold processing clamsproject/aapb-annotations#91

Merged

bohJiang12 added 7 commits July 29, 2024 16:39

Added log feature enabling export debug messages

df74628

added predictions of rfb running on batch90

d026a17

Reimplemented calculating IOU and enabled parallel processing

94a1acd

added goldretriver to download gold data

1d3681c

Tested evaluation script

770958c

Completed first round of evaluation

c759cd7

Added report to evaluation of batch 44

2add716