Text extraction (Special math elements) #4550
Unanswered
Prasaderp
asked this question in
Looking for help
Replies: 2 comments 1 reply
-
Mathematical formulas usually are a conglomerate of text (written in a mixture of standard and specialized math fonts) plus vector graphics. |
Beta Was this translation helpful? Give feedback.
1 reply
-
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I was working around extracting text from pdf's. PyMuPDF is very good at extracting complex elements related to math. While I was working around it, I found out that it is not able to extract elements like fractions from the source PDF.
I have attached 2 screenshots of the input PDF and extracted text in a PPT using python PPTX.
If you could let me know, is it the limitation of the PyMupDF or python-pptx for this rendering issue
Beta Was this translation helpful? Give feedback.
All reactions