Skip to content

Commit

Permalink
Update strawberryrunners.md
Browse files Browse the repository at this point in the history
Add line break to start list formatting
  • Loading branch information
alliomeria authored Jan 17, 2025
1 parent f26ddc7 commit d039fc0
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions docs/strawberryrunners.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,7 @@ tags:
Archipelago's [Strawberry Runners (SBR)](https://github.com/esmero/strawberry_runners) module provides provides a set of post-processing capabilities for the JSON based metadata, files and entities that comprise your Archipelago Digital Objects (ADOs). These post-processing actions are based on dispatched events, direct http calls, and invoked webhooks from partner services (such as Min.io, AWS S3 or self-invoked).

The default Archipelago SBR post-processor configurations include operations that:

- perform page-based HOCR/OCR for image and pdf-based ADOs, send the output to the Search API, and use Natural Language Processing to extract entities from the output
- extract text from pages within a Webarchives File and send the output to the Search API
- convert WARC format Webarchives Files into WACZ format and attach the new WACZ file to the original source ADO to complement the WARC original
Expand Down

0 comments on commit d039fc0

Please sign in to comment.