Replies: 1 comment
-
|
I would use a vision language model for this one - you can turn each xhtml page into a png and then ingest as images. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hey team, does anybody know any good utility for saving XHTML to PDF?
I tested many libraries (pandoc, wkhtmltopdf, weasyprint and others), but none gave a good conversion, the only good conversion from XHTML to PDF is by print saving to PDF in Chromium, which is a very slow operation
example XHTML doc
example HTML doc
The ultimate goal is XHTML -> Markdown. However, none of the major conversion libraries supports XHTML
cc @emrgnt-cmplxty for visibility
Beta Was this translation helpful? Give feedback.
All reactions