Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add more "seqtag" annotation for rfb training #88

Open
keighrim opened this issue Jun 18, 2024 · 0 comments
Open

add more "seqtag" annotation for rfb training #88

keighrim opened this issue Jun 18, 2024 · 0 comments

Comments

@keighrim
Copy link
Member

keighrim commented Jun 18, 2024

This issue is to track effort to add more sequential tagging annotation data to improve RFB model performance.

A few notes:

  1. For next rounds, I highly recommend setting up traceable data prep pipeline to link the images (or OCR results) back to its originating videos/timestamp.
  2. Since everyone agrees that erroneous OCR results is the major cause of poor performance of the model, we need to also think about how to improve OCR while adding more of this (silver) seqtag data. As we are adding a new OCR engine (paddleOCR), first thing we need to know is whether paddle can outperform docTR and thus can replace docTR in the pipeline. (wait for improve OCR evaluation script aapb-evaluations#52)
@clams-bot clams-bot added this to infra Jun 18, 2024
@github-project-automation github-project-automation bot moved this to Todo in infra Jun 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Todo
Development

No branches or pull requests

1 participant