How can I extract data from files? #5405
Unanswered
bolodecenouracomcafe
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Unfortunately, at the moment there is no such feature for Amazon OpenSearch Ingestion. I'd suggest both filing a feature request and perhaps checking out places like https://www.aryn.ai/ that offer specialized solutions for this kind of thing. You could potentially use Aryn to generate the json which would be the "Extract and Transform" part of the ETL pipeline and then have Data Prepper grab the final json, but the actual parsing of the unstructured data in your PDF's can't be handled by Data Prepper by itself. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello, community,
I have been using the Ingest Attachment plugin to ingest my PDF (and other) files, and I would like to migrate to Data Prepper (Amazon OpenSearch Ingestion) for the ingestion process.
How can I extract data from files with Data Prepper? Is there a processor to extract data from files?
Beta Was this translation helpful? Give feedback.
All reactions