Skip to content

Commit 315d19b

Browse files
committed
Initial commit for public release.
0 parents  commit 315d19b

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

95 files changed

+5150
-0
lines changed

.github/contexts-example.png

132 KB
Loading

.github/gh-teaser.png

100 KB
Loading

.github/system.png

146 KB
Loading

.gitignore

Lines changed: 147 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,147 @@
1+
# Output directories
2+
outputs/
3+
multirun/
4+
ray_results/
5+
6+
# Byte-compiled / optimized / DLL files
7+
__pycache__/
8+
*.py[cod]
9+
*$py.class
10+
11+
# C extensions
12+
*.so
13+
14+
# Distribution / packaging
15+
.Python
16+
build/
17+
develop-eggs/
18+
dist/
19+
downloads/
20+
eggs/
21+
.eggs/
22+
lib/
23+
lib64/
24+
parts/
25+
sdist/
26+
var/
27+
wheels/
28+
share/python-wheels/
29+
*.egg-info/
30+
.installed.cfg
31+
*.egg
32+
MANIFEST
33+
34+
# PyInstaller
35+
# Usually these files are written by a python script from a template
36+
# before PyInstaller builds the exe, so as to inject date/other infos into it.
37+
*.manifest
38+
*.spec
39+
40+
# Installer logs
41+
pip-log.txt
42+
pip-delete-this-directory.txt
43+
44+
# Unit test / coverage reports
45+
htmlcov/
46+
.tox/
47+
.nox/
48+
.coverage
49+
.coverage.*
50+
.cache
51+
nosetests.xml
52+
coverage.xml
53+
*.cover
54+
*.py,cover
55+
.hypothesis/
56+
.pytest_cache/
57+
cover/
58+
59+
# Translations
60+
*.mo
61+
*.pot
62+
63+
# Django stuff:
64+
*.log
65+
local_settings.py
66+
db.sqlite3
67+
db.sqlite3-journal
68+
69+
# Flask stuff:
70+
instance/
71+
.webassets-cache
72+
73+
# Scrapy stuff:
74+
.scrapy
75+
76+
# Sphinx documentation
77+
docs/_build/
78+
79+
# PyBuilder
80+
.pybuilder/
81+
target/
82+
83+
# Jupyter Notebook
84+
.ipynb_checkpoints
85+
86+
# IPython
87+
profile_default/
88+
ipython_config.py
89+
90+
# pyenv
91+
# For a library or package, you might want to ignore these files since the code is
92+
# intended to run in multiple environments; otherwise, check them in:
93+
# .python-version
94+
95+
# pipenv
96+
# According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
97+
# However, in case of collaboration, if having platform-specific dependencies or dependencies
98+
# having no cross-platform support, pipenv may install dependencies that don't work, or not
99+
# install all needed dependencies.
100+
#Pipfile.lock
101+
102+
# PEP 582; used by e.g. github.com/David-OConnor/pyflow
103+
__pypackages__/
104+
105+
# Celery stuff
106+
celerybeat-schedule
107+
celerybeat.pid
108+
109+
# SageMath parsed files
110+
*.sage.py
111+
112+
# Environments
113+
.env
114+
.venv
115+
env/
116+
venv/
117+
ENV/
118+
env.bak/
119+
venv.bak/
120+
.python-version
121+
122+
# Spyder project settings
123+
.spyderproject
124+
.spyproject
125+
126+
# Rope project settings
127+
.ropeproject
128+
129+
# mkdocs documentation
130+
/site
131+
132+
# mypy
133+
.mypy_cache/
134+
.dmypy.json
135+
dmypy.json
136+
137+
# Pyre type checker
138+
.pyre/
139+
140+
# pytype static type analyzer
141+
.pytype/
142+
143+
# Cython debug symbols
144+
cython_debug/
145+
146+
# IDE
147+
.idea/

Datasets.md

Lines changed: 92 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,92 @@
1+
We use various synthetic and real datasets. More info is in Appendix F of the supplementary material. Some preprocessing scripts are included in [`tools/`](tools).
2+
3+
| Dataset | Type | Remarks |
4+
|:-------:|:-----:|:--------|
5+
| [MJSynth](https://www.robots.ox.ac.uk/~vgg/data/text/) | synthetic | Case-sensitive annotations were extracted from the image filenames |
6+
| [SynthText](https://www.robots.ox.ac.uk/~vgg/data/scenetext/) | synthetic | Processed with [`crop_by_word_bb_syn90k.py`](https://github.com/FangShancheng/ABINet/blob/main/tools/crop_by_word_bb_syn90k.py) |
7+
| [IC13](https://rrc.cvc.uab.es/?ch=2) | real | Three archives: 857, 1015, 1095 (full) |
8+
| [IC15](https://rrc.cvc.uab.es/?ch=4) | real | Two archives: 1811, 2077 (full) |
9+
| [CUTE80](http://cs-chan.com/downloads_cute80_dataset.html) | real | \[1\] |
10+
| [IIIT5k](https://cvit.iiit.ac.in/research/projects/cvit-projects/the-iiit-5k-word-dataset) | real | \[1\] |
11+
| [SVT](http://vision.ucsd.edu/~kai/svt/) | real | \[1\] |
12+
| [SVTP](https://openaccess.thecvf.com/content_iccv_2013/html/Phan_Recognizing_Text_with_2013_ICCV_paper.html) | real | \[1\] |
13+
| [ArT](https://rrc.cvc.uab.es/?ch=14) | real | \[2\] |
14+
| [COCO-Text v1.4](https://rrc.cvc.uab.es/?ch=5) | real | \[2\] |
15+
| [LSVT](https://rrc.cvc.uab.es/?ch=16) | real | \[2\] |
16+
| [MLT19](https://rrc.cvc.uab.es/?ch=15) | real | \[2\] |
17+
| [RCTW17](https://rctw.vlrlab.net/dataset.html) | real | \[2\] |
18+
| [ReCTS](https://rrc.cvc.uab.es/?ch=12) | real | \[2\] |
19+
| [Uber-Text](https://s3-us-west-2.amazonaws.com/uber-common-public/ubertext/index.html) | real | \[2\] |
20+
| [COCO-Text v2.0](https://bgshih.github.io/cocotext/) | real | Processed with [`coco_2_converter.py`](tools/coco_2_converter.py) |
21+
| [OpenVINO](https://proceedings.mlr.press/v157/krylov21a.html) | real | [Annotations](https://storage.openvinotoolkit.org/repositories/openvino_training_extensions/datasets/open_images_v5_text/) for a subset of [Open Images](https://github.com/cvdfoundation/open-images-dataset). Processed with [`openvino_converter.py`](tools/openvino_converter.py). |
22+
| [TextOCR](https://textvqa.org/textocr/) | real | Annotations for a subset of Open Images. Processed with [`textocr_converter.py`](tools/textocr_converter.py). A _horizontal_ version can be generated by passing `--rectify_pose`. |
23+
24+
\[1\] Case-sensitive annotations from [Long and Yao](https://github.com/Jyouhou/Case-Sensitive-Scene-Text-Recognition-Datasets) + [our corrections](https://github.com/baudm/Case-Sensitive-Scene-Text-Recognition-Datasets). Processed with [case_sensitive_str_datasets_converter.py](tools/case_sensitive_str_datasets_converter.py)<br/>
25+
\[2\] Archives used as-is from [Baek et al.](https://github.com/ku21fan/STR-Fewer-Labels/blob/main/data.md) They are included in the dataset release for convenience. Please refer to their work for more info about the datasets.
26+
27+
The preprocessed archives are available here: [val + test + most of train](https://drive.google.com/drive/folders/1NYuoi7dfJVgo-zUJogh8UQZgIMpLviOE), [TextOCR + OpenVINO](https://drive.google.com/drive/folders/1D9z_YJVa6f-O0juni-yG5jcwnhvYw-qC)
28+
29+
The expected filesystem structure is as follows:
30+
```
31+
data
32+
├── test
33+
│ ├── ArT
34+
│ ├── COCOv1.4
35+
│ ├── CUTE80
36+
│ ├── IC13_1015
37+
│ ├── IC13_1095 # Full IC13 test set. Typically not used for benchmarking but provided here for convenience.
38+
│ ├── IC13_857
39+
│ ├── IC15_1811
40+
│ ├── IC15_2077
41+
│ ├── IIIT5k
42+
│ ├── SVT
43+
│ ├── SVTP
44+
│ └── Uber
45+
├── train
46+
│ ├── real
47+
│ │ ├── ArT
48+
│ │ │ ├── train
49+
│ │ │ └── val
50+
│ │ ├── COCOv2.0
51+
│ │ │ ├── train
52+
│ │ │ └── val
53+
│ │ ├── LSVT
54+
│ │ │ ├── test
55+
│ │ │ ├── train
56+
│ │ │ └── val
57+
│ │ ├── MLT19
58+
│ │ │ ├── test
59+
│ │ │ ├── train
60+
│ │ │ └── val
61+
│ │ ├── OpenVINO
62+
│ │ │ ├── train_1
63+
│ │ │ ├── train_2
64+
│ │ │ ├── train_5
65+
│ │ │ ├── train_f
66+
│ │ │ └── validation
67+
│ │ ├── RCTW17
68+
│ │ │ ├── test
69+
│ │ │ ├── train
70+
│ │ │ └── val
71+
│ │ ├── ReCTS
72+
│ │ │ ├── test
73+
│ │ │ ├── train
74+
│ │ │ └── val
75+
│ │ ├── TextOCR
76+
│ │ │ ├── train
77+
│ │ │ └── val
78+
│ │ └── Uber
79+
│ │ ├── train
80+
│ │ └── val
81+
│ └── synth
82+
│ ├── MJ
83+
│ │ ├── test
84+
│ │ ├── train
85+
│ │ └── val
86+
│ └── ST
87+
└── val
88+
├── IC13
89+
├── IC15
90+
├── IIIT5k
91+
└── SVT
92+
```

0 commit comments

Comments
 (0)