Skip to content

Releases: OCR-D/core

v2.57.0

18 Oct 17:31
@kba kba
Compare
Choose a tag to compare

Fixed:

  • running a processor as a worker no longer dumps ocrd-tool.json and messed up logging, #1116

Changed:

  • logging: With ocrd_logging.conf (e.g. in Docker), log all messages DEBUG and up and log to ocrd.log, #1117

v2.56.0

13 Oct 08:33
@kba kba
Compare
Choose a tag to compare

Changed:

  • A separate logging dir tree structure for the modules (processing servers, processing workers, processor servers, mets servers, processing jobs). Configurable with env v
    • Processing job-level logging - each job is logged into a separate file with format {job_id}.log
    • Processing job-level logging file paths are added to the Job models and preserved in the database.
    • The ocrd_network logging is based on the format provided in ocrd_utils
  • Support env variable OCRD_NETWORK_SOCKETS_ROOT_DIR for setting the root directory for METS server sockets, #1111
  • An endpoint /job/{id}/log for getting the log file of a processing job of a processor, #1111

v2.55.2

13 Oct 08:33
@kba kba
Compare
Choose a tag to compare

Fixed:

v2.55.1

12 Oct 10:14
@kba kba
Compare
Choose a tag to compare

Changed:

  • ocrd workspace bulk-add distinguishes between url and local_filename, supporting both, #1086, #1079, #1113

v2.55.0

11 Oct 11:55
@kba kba
Compare
Choose a tag to compare

Added:

  • /workflow endpoint that can handle ocrd process workflows and distribute jobs page-wise across workers, #1083, #1105, #1108, #1109

Changed:

  • METS Server: Make sockets world-readable and -writable, #1098, #1099
  • METS Server: Implement find_files support for local_filename and url, #1100
  • Logging: consistent logger names derived from ocrd., #1101
  • Logging: consistent logging across the packages, including ocrd_network, #1101
  • .. page range operator: allow single-page ranges, #1106, #1107

v2.54.0

12 Sep 15:30
@kba kba
Compare
Choose a tag to compare

Added:

  • METS Server: providing concurrent additional access to the METS file for parallel processing, #966
  • Web API: Cache jobs in the processing server with an optional callback once processed, #1069
  • Web API: Lock pages output file groups of a workspace to prevent simultaneous non-additive access to workspaces, #1069
  • Web API: Support job dependency for caching complete fully-deterministic workflows, #1069
  • Web API: Processing server will start all ready requests, not just the first one, #1069
  • Web API: Workers will create on demand, retry attempts configurable via OCRD_NETWORK_WORKER_QUEUE_CONNECT_ATTEMPTS, #1093
  • ocrd_utils.config to collect all configuration based on environment variables in one place, #1081
  • Processor CLI: Warn if a given page ID cannot be found in METS, #1088, #1089

Changed:

  • Processors now have worker and server subcommands, with separate --help, for starting processing worker/processor server, #1087
  • Move tf_disable_interactive_logs (to silence keras/tensorflow print statements) to ocrd_utils.logging and do not call on module-level, #1090, #1091
  • 🔥 We do now properly distinguish between original and local-file FLocat, original URL will not be changed for downloads anymore, #323, #1079
  • 🔥 logging has been streamlined to be better usable as a library, #1080

v2.53.0

12 Sep 15:29
@kba kba
Compare
Choose a tag to compare

Fixed:

  • WorkspaceValidator: make the check for consistency of pc:Page[@pcGtsId] and mets:file[@ID] optional with the mets_fileid_page_pcgtsid skip flag, #1066
  • ocrd resmgr download: use basedir as an arg and not a kwarg, #1078

Changed:

  • WorkspaceValidator: Download files temporarily/on-demand, #1066
  • ocrd-* --version now prints only the version of the processor without noise or core version, #1068

Added

  • Environement variables to control optional retries and timeouts for downloading files:
    • OCRD_DOWNLOAD_RETRIES: Number of times to retry failed attempts for downloads of workspace files. #1073
    • OCRD_DOWNLOAD_TIMEOUT: Timeout in seconds for connecting or reading (comma-separated) when downloading. #1073
  • Environment variables used throughout core are now documented in README and ocrd --help, #1073
  • Web API: --create-queue option to on-demand create RabbitMQ for processing workers, #1075
  • Web API: --queue-connec-attempts to retry connection to RabbitMQ in case server is not yet running, #1075

v2.52.0

26 Jun 17:08
@kba kba
Compare
Choose a tag to compare

Added:

  • make deps-cuda: Makefile target to set up a working CUDA installation, both for native and Dockerfile.cuda, #1055
  • Implementation of the Standalone Processor Server module, #1030
  • ocrd_utils.guess_media_type to consistently try to determine media type from a file name, #1045

Changed:

  • Refactoring the Network CLI, all network module CLI are in ocrd_network now, #1030
  • The Processing Server uses ocrd-all-tool.json file, removing local processor install dependencies, #1030
  • Overall improvement and refactoring of the ocrd_network package, #1030
  • Optionally skip deployment of mongodb and rabbitmq to make external usage/management possible, #1048
  • page_from_file now also accepts a (str) file path in addition to OcrdFile, #1045
  • packaging: install/uninstall in correct build order, use python -m build instead of python setup.py sdist bdist_wheel, #1051

Removed:

  • Obsolete travis CI configuration removed, #1056
  • Support for end-of-life python versions 3.5 and 3.6, #1057

Fixed:

  • Makefile FIND_VERSION macro: use grep -E instead of grep -P for macos compatibility, #1060
  • ocrd resmgr: detect HTTP errors as such and don't try to continue if HTTP >= 400, #1062
  • PageValidator: Ensure TextLine has coordinates when checking for Baseline containment, #1049

v2.51.0

07 Jun 11:31
@kba kba
Compare
Choose a tag to compare

Changed:

  • core cuda Docker: CUDA base image working again, based on ocrd/core not nvidia/cuda in a separate Dockerfile.cuda, #1041
  • core-cuda Docker: adopt #1008 (venv under /usr/local, as in ocrd_all, instead of dist-packages), #1041
  • core-cuda Docker: use conda (micromamba) for CUDA toolkit, and nvidia-pyindex for CUDA libs – instead of nvidia/cuda base image, #1041
  • more robust workaround for shapely#1598, #1041

Removed:

  • Revert #882 (fastentrypoints) as it enforces deps versions at runtime
  • Drop ocrd_utils.package_resources and use pkg_resources.* directly, #1041
  • ocrd resmgr: Drop redundant (processor-provided) entries in the central resource_list.yml.

v2.50.0

24 Apr 11:41
@kba kba
Compare
Choose a tag to compare

Added:

  • 🔥 ocrd_network: Components related to OCR-D Web API, #974

Changed:

  • bashlib: support file input from multiple file groups, #1027, #1031

Fixed:

  • Don't output default docstrings for bashlib processors, #1026