You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As part of moving towards an automated process, it is generally easier if the evaluation process in each task is as similar as possible.
Therefore, it should be considered to standardize:
All python evaluation scripts to be evaluate.py
These scripts should use the same argument structure and shortforms, e.g. -g, -m/-p, -r.
Done when
No response
Additional context
No response
The text was updated successfully, but these errors were encountered:
Because
As part of moving towards an automated process, it is generally easier if the evaluation process in each task is as similar as possible.
Therefore, it should be considered to standardize:
evaluate.py
-g
,-m
/-p
,-r
.Done when
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: