Triggering OCR for images #1214

goheretech · 2025-04-25T20:36:55Z

goheretech
Apr 25, 2025

I have a question about using this to extract text from images.

I have tested this on every file type I can think of, including youtube links, and am very impressed with the results, but for some reason when I submit .png or .jpg of documents or other text, the return is empty which leads me to believe the ocr isnt triggering.

Is anyone else seeing similar results?

I have converted .pdfs of just images and that was just fine.

txhno · 2026-04-13T12:55:31Z

txhno
Apr 13, 2026

Yes, that matches the current behavior.

Plain .jpg / .png does not seem to do built-in OCR by itself. The basic image converter is mostly metadata plus optional LLM image description, so empty output can happen.

If you want actual OCR on images, use Azure Document Intelligence. The OCR plugin is for PDF / DOCX / PPTX / XLSX, not standalone image files.

0 replies

VANDRANKI · 2026-04-13T19:30:34Z

VANDRANKI
Apr 13, 2026

markitdown does not run OCR on its own. For plain images (PNG, JPG), it relies on an LLM to describe the image. You can pass an llm_client and llm_model when creating the MarkItDown instance and it will call the model to generate a text description.

If you need actual OCR (extracting the text that appears in the image), the markitdown-ocr plugin handles that. You can install it separately and it hooks in automatically.

So the short answer: built-in support is LLM description, OCR is via plugin.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Triggering OCR for images #1214

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Triggering OCR for images #1214

Uh oh!

goheretech Apr 25, 2025

Replies: 2 comments

Uh oh!

txhno Apr 13, 2026

Uh oh!

VANDRANKI Apr 13, 2026

goheretech
Apr 25, 2025

txhno
Apr 13, 2026

VANDRANKI
Apr 13, 2026