|
| 1 | +We use various synthetic and real datasets. More info is in Appendix F of the supplementary material. Some preprocessing scripts are included in [`tools/`](tools). |
| 2 | + |
| 3 | +| Dataset | Type | Remarks | |
| 4 | +|:-------:|:-----:|:--------| |
| 5 | +| [MJSynth](https://www.robots.ox.ac.uk/~vgg/data/text/) | synthetic | Case-sensitive annotations were extracted from the image filenames | |
| 6 | +| [SynthText](https://www.robots.ox.ac.uk/~vgg/data/scenetext/) | synthetic | Processed with [`crop_by_word_bb_syn90k.py`](https://github.com/FangShancheng/ABINet/blob/main/tools/crop_by_word_bb_syn90k.py) | |
| 7 | +| [IC13](https://rrc.cvc.uab.es/?ch=2) | real | Three archives: 857, 1015, 1095 (full) | |
| 8 | +| [IC15](https://rrc.cvc.uab.es/?ch=4) | real | Two archives: 1811, 2077 (full) | |
| 9 | +| [CUTE80](http://cs-chan.com/downloads_cute80_dataset.html) | real | \[1\] | |
| 10 | +| [IIIT5k](https://cvit.iiit.ac.in/research/projects/cvit-projects/the-iiit-5k-word-dataset) | real | \[1\] | |
| 11 | +| [SVT](http://vision.ucsd.edu/~kai/svt/) | real | \[1\] | |
| 12 | +| [SVTP](https://openaccess.thecvf.com/content_iccv_2013/html/Phan_Recognizing_Text_with_2013_ICCV_paper.html) | real | \[1\] | |
| 13 | +| [ArT](https://rrc.cvc.uab.es/?ch=14) | real | \[2\] | |
| 14 | +| [COCO-Text v1.4](https://rrc.cvc.uab.es/?ch=5) | real | \[2\] | |
| 15 | +| [LSVT](https://rrc.cvc.uab.es/?ch=16) | real | \[2\] | |
| 16 | +| [MLT19](https://rrc.cvc.uab.es/?ch=15) | real | \[2\] | |
| 17 | +| [RCTW17](https://rctw.vlrlab.net/dataset.html) | real | \[2\] | |
| 18 | +| [ReCTS](https://rrc.cvc.uab.es/?ch=12) | real | \[2\] | |
| 19 | +| [Uber-Text](https://s3-us-west-2.amazonaws.com/uber-common-public/ubertext/index.html) | real | \[2\] | |
| 20 | +| [COCO-Text v2.0](https://bgshih.github.io/cocotext/) | real | Processed with [`coco_2_converter.py`](tools/coco_2_converter.py) | |
| 21 | +| [OpenVINO](https://proceedings.mlr.press/v157/krylov21a.html) | real | [Annotations](https://storage.openvinotoolkit.org/repositories/openvino_training_extensions/datasets/open_images_v5_text/) for a subset of [Open Images](https://github.com/cvdfoundation/open-images-dataset). Processed with [`openvino_converter.py`](tools/openvino_converter.py). | |
| 22 | +| [TextOCR](https://textvqa.org/textocr/) | real | Annotations for a subset of Open Images. Processed with [`textocr_converter.py`](tools/textocr_converter.py). A _horizontal_ version can be generated by passing `--rectify_pose`. | |
| 23 | + |
| 24 | +\[1\] Case-sensitive annotations from [Long and Yao](https://github.com/Jyouhou/Case-Sensitive-Scene-Text-Recognition-Datasets) + [our corrections](https://github.com/baudm/Case-Sensitive-Scene-Text-Recognition-Datasets). Processed with [case_sensitive_str_datasets_converter.py](tools/case_sensitive_str_datasets_converter.py)<br/> |
| 25 | +\[2\] Archives used as-is from [Baek et al.](https://github.com/ku21fan/STR-Fewer-Labels/blob/main/data.md) They are included in the dataset release for convenience. Please refer to their work for more info about the datasets. |
| 26 | + |
| 27 | +The preprocessed archives are available here: [val + test + most of train](https://drive.google.com/drive/folders/1NYuoi7dfJVgo-zUJogh8UQZgIMpLviOE), [TextOCR + OpenVINO](https://drive.google.com/drive/folders/1D9z_YJVa6f-O0juni-yG5jcwnhvYw-qC) |
| 28 | + |
| 29 | +The expected filesystem structure is as follows: |
| 30 | +``` |
| 31 | +data |
| 32 | +├── test |
| 33 | +│ ├── ArT |
| 34 | +│ ├── COCOv1.4 |
| 35 | +│ ├── CUTE80 |
| 36 | +│ ├── IC13_1015 |
| 37 | +│ ├── IC13_1095 # Full IC13 test set. Typically not used for benchmarking but provided here for convenience. |
| 38 | +│ ├── IC13_857 |
| 39 | +│ ├── IC15_1811 |
| 40 | +│ ├── IC15_2077 |
| 41 | +│ ├── IIIT5k |
| 42 | +│ ├── SVT |
| 43 | +│ ├── SVTP |
| 44 | +│ └── Uber |
| 45 | +├── train |
| 46 | +│ ├── real |
| 47 | +│ │ ├── ArT |
| 48 | +│ │ │ ├── train |
| 49 | +│ │ │ └── val |
| 50 | +│ │ ├── COCOv2.0 |
| 51 | +│ │ │ ├── train |
| 52 | +│ │ │ └── val |
| 53 | +│ │ ├── LSVT |
| 54 | +│ │ │ ├── test |
| 55 | +│ │ │ ├── train |
| 56 | +│ │ │ └── val |
| 57 | +│ │ ├── MLT19 |
| 58 | +│ │ │ ├── test |
| 59 | +│ │ │ ├── train |
| 60 | +│ │ │ └── val |
| 61 | +│ │ ├── OpenVINO |
| 62 | +│ │ │ ├── train_1 |
| 63 | +│ │ │ ├── train_2 |
| 64 | +│ │ │ ├── train_5 |
| 65 | +│ │ │ ├── train_f |
| 66 | +│ │ │ └── validation |
| 67 | +│ │ ├── RCTW17 |
| 68 | +│ │ │ ├── test |
| 69 | +│ │ │ ├── train |
| 70 | +│ │ │ └── val |
| 71 | +│ │ ├── ReCTS |
| 72 | +│ │ │ ├── test |
| 73 | +│ │ │ ├── train |
| 74 | +│ │ │ └── val |
| 75 | +│ │ ├── TextOCR |
| 76 | +│ │ │ ├── train |
| 77 | +│ │ │ └── val |
| 78 | +│ │ └── Uber |
| 79 | +│ │ ├── train |
| 80 | +│ │ └── val |
| 81 | +│ └── synth |
| 82 | +│ ├── MJ |
| 83 | +│ │ ├── test |
| 84 | +│ │ ├── train |
| 85 | +│ │ └── val |
| 86 | +│ └── ST |
| 87 | +└── val |
| 88 | + ├── IC13 |
| 89 | + ├── IC15 |
| 90 | + ├── IIIT5k |
| 91 | + └── SVT |
| 92 | +``` |
0 commit comments