Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Modal OCR beta #91

Merged
merged 3 commits into from
Dec 1, 2024
Merged

Modal OCR beta #91

merged 3 commits into from
Dec 1, 2024

Conversation

zakhar-kogan
Copy link
Contributor

@zakhar-kogan zakhar-kogan commented Nov 30, 2024

Modal OCR Implementation

Adds OCR functionality using Modal endpoint integration:

  • Added ocr_modal() function for making OCR requests
  • Added ocr_content() function to handle OCR response parsing
  • Configurable via MODAL_ENDPOINT environment variable
  • Returns standardized OcrResult type

The OCR service accepts image bytes and returns extracted text

@zakhar-kogan zakhar-kogan marked this pull request as draft November 30, 2024 16:14
@zakhar-kogan zakhar-kogan marked this pull request as ready for review November 30, 2024 16:15
@ohld ohld merged commit 2a2b3a6 into ffmemes:production Dec 1, 2024
1 check failed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants