Skip to content

Commit 8cd475f

Browse files
committed
Remove pandas dependency in minimal requirements
1 parent 99e019f commit 8cd475f

24 files changed

+486
-213
lines changed

MANIFEST.in

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
include *.txt
2+
recursive-include extras *.txt
3+
recursive-include textractor *

NOTICE

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,2 @@
11
Amazon Textract Textractor
2-
Copyright 2019 Amazon.com, Inc. or its affiliates. All Rights Reserved.
2+
Copyright 2022 Amazon.com, Inc. or its affiliates. All Rights Reserved.

README.md

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -14,13 +14,14 @@ If you are looking for the other amazon-textract-* packages, you can find them u
1414

1515
## Installation
1616

17-
Textractor is available on PyPI and can be installed with `pip install amazon-textract-textractor`. By default this will install the minimal version of textractor. The following extras can be used to add features:
17+
Textractor is available on PyPI and can be installed with `pip install amazon-textract-textractor`. By default this will install the minimal version of Textractor which is suitable for lambda execution. The following extras can be used to add features:
1818

19-
- `pdf` (`pip install amazon-textract-textractor[pdf]`) includes `pdf2image` and enables PDF rasterization in Textractor. Note that this is **not** necessary to call Textract with a PDF file.
20-
- `torch` (`pip install amazon-textract-textractor[torch]`) includes `sentence_transformers` for better word search and matching. This will work on CPU but be noticeably slower than non-machine learning based approaches.
21-
- `dev` (`pip install amazon-textract-textractor[dev]`) includes all the dependencies above and everything else needed to test the code.
19+
- `pandas` (`pip install "amazon-textract-textractor[pandas]"`) installs pandas which is used to enable DataFrame and CSV exports.
20+
- `pdf` (`pip install "amazon-textract-textractor[pdf]"`) includes `pdf2image` and enables PDF rasterization in Textractor. Note that this is **not** necessary to call Textract with a PDF file.
21+
- `torch` (`pip install "amazon-textract-textractor[torch]"`) includes `sentence_transformers` for better word search and matching. This will work on CPU but be noticeably slower than non-machine learning based approaches.
22+
- `dev` (`pip install "amazon-textract-textractor[dev]"`) includes all the dependencies above and everything else needed to test the code.
2223

23-
You can pick several extras by separating the labels with commas like this `pip install amazon-textract-textractor[pdf,torch]`.
24+
You can pick several extras by separating the labels with commas like this `pip install "amazon-textract-textractor[pdf,torch]"`.
2425

2526
## Documentation
2627

401 KB
Loading

docs/source/notebooks/table_data_to_various_formats.ipynb

Lines changed: 73 additions & 11 deletions
Large diffs are not rendered by default.

extras/pandas.txt

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
numpy==1.21.*
2+
pandas

requirements.txt

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,10 @@
1-
numpy==1.21.*
21
awscli
32
amazon-textract-response-parser==0.1.33
43
amazon-textract-caller==0.0.24
54
boto3==1.24.*
65
botocore==1.27.90
76
jsonschema
87
Pillow
9-
pandas
108
tabulate==0.8.10
119
XlsxWriter==3.0.3
1210
pyxDamerauLevenshtein==1.7.1

setup.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ def read_requirements(path):
1616
setup(
1717
# include data files
1818
name="amazon-textract-textractor",
19-
version="1.0.5",
19+
version="1.0.6",
2020
description="A package to use AWS Textract services.",
2121
long_description=long_description,
2222
long_description_content_type="text/markdown",

textractor/cli/cli.py

Lines changed: 212 additions & 64 deletions
Large diffs are not rendered by default.

textractor/data/constants.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -209,6 +209,7 @@ class AnalyzeIDFields(Enum):
209209
# Only available in passports
210210
PLACE_OF_BIRTH = "PLACE_OF_BIRTH"
211211

212+
212213
class CLIPrint(Enum):
213214
ALL = 0
214215
TEXT = 1
@@ -218,6 +219,7 @@ class CLIPrint(Enum):
218219
EXPENSES = 5
219220
IDS = 6
220221

222+
221223
class CLIOverlay(Enum):
222224
ALL = 0
223225
WORDS = 1

0 commit comments

Comments
 (0)