|
23 | 23 | - 输入输出格式对齐RapidTable
|
24 | 24 | - 支持模型自动下载
|
25 | 25 | - 增加来自paddle的新表格分类模型
|
| 26 | + - 增加最新PaddleX表格识别模型测评值 |
26 | 27 |
|
27 | 28 | ### 简介
|
28 | 29 | 💖该仓库是用来对文档中表格做结构化识别的推理库,包括来自阿里读光有线和无线表格识别模型,llaipython(微信)贡献的有线表格模型,网易Qanything内置表格分类模型等。\
|
|
56 | 57 | Surya-Tabled 使用内置ocr模块,表格模型为行列识别模型,无法识别单元格合并,导致分数较低
|
57 | 58 |
|
58 | 59 | | 方法 | TEDS | TEDS-only-structure |
|
59 |
| -|:---------------------------------------------------------------------------------------------------------|:-----------:|:-------------------:| |
60 |
| -| [surya-tabled(--skip-detect)](https://github.com/VikParuchuri/tabled) | 0.33437 | 0.65865 | |
61 |
| -| [surya-tabled](https://github.com/VikParuchuri/tabled) | 0.33940 | 0.67103 | |
62 |
| -| [deepdoctection(table-transformer)](https://github.com/deepdoctection/deepdoctection?tab=readme-ov-file) | 0.59975 | 0.69918 | |
63 |
| -| [ppstructure_table_master](https://github.com/PaddlePaddle/PaddleOCR/tree/main/ppstructure) | 0.61606 | 0.73892 | |
64 |
| -| [ppsturcture_table_engine](https://github.com/PaddlePaddle/PaddleOCR/tree/main/ppstructure) | 0.67924 | 0.78653 | |
65 |
| -| [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) | 0.67310 | 0.81210 | |
66 |
| -| [RapidTable(SLANet)](https://github.com/RapidAI/RapidTable) | 0.71654 | 0.81067 | |
67 |
| -| table_cls + wired_table_rec v1 + lineless_table_rec | 0.75288 | 0.82574 | |
68 |
| -| table_cls + wired_table_rec v2 + lineless_table_rec | 0.77676 | 0.84580 | |
69 |
| -| [RapidTable(SLANet-plus)](https://github.com/RapidAI/RapidTable) | 0.84481 | 0.91369 | |
70 |
| -| [RapidTable(unitable)](https://github.com/RapidAI/RapidTable) | **0.86200** | **0.91813** | |
| 60 | +|:---------------------------------------------------------------------------------------------------------|:-----------:|:-----------------:| |
| 61 | +| [surya-tabled(--skip-detect)](https://github.com/VikParuchuri/tabled) | 0.33437 | 0.65865 | |
| 62 | +| [surya-tabled](https://github.com/VikParuchuri/tabled) | 0.33940 | 0.67103 | |
| 63 | +| [deepdoctection(table-transformer)](https://github.com/deepdoctection/deepdoctection?tab=readme-ov-file) | 0.59975 | 0.69918 | |
| 64 | +| [ppstructure_table_master](https://github.com/PaddlePaddle/PaddleOCR/tree/main/ppstructure) | 0.61606 | 0.73892 | |
| 65 | +| [ppsturcture_table_engine](https://github.com/PaddlePaddle/PaddleOCR/tree/main/ppstructure) | 0.67924 | 0.78653 | |
| 66 | +| [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) | 0.67310 | 0.81210 | |
| 67 | +| [RapidTable(SLANet)](https://github.com/RapidAI/RapidTable) | 0.71654 | 0.81067 | |
| 68 | +| table_cls + wired_table_rec v1 + lineless_table_rec | 0.75288 | 0.82574 | |
| 69 | +| table_cls + wired_table_rec v2 + lineless_table_rec | 0.77676 | 0.84580 | |
| 70 | +| [PaddleX(SLANetXt+RT-DERT)](https://github.com/PaddlePaddle/PaddleX) | 0.79900 | **0.92222** | |
| 71 | +| [RapidTable(SLANet-plus)](https://github.com/RapidAI/RapidTable) | 0.84481 | 0.91369 | |
| 72 | +| [RapidTable(unitable)](https://github.com/RapidAI/RapidTable) | **0.86200** | 0.91813 | |
71 | 73 |
|
72 | 74 | ### 使用建议
|
73 | 75 | wired_table_rec_v2(有线表格精度最高): 通用场景有线表格(论文,杂志,期刊, 收据,单据,账单)
|
|
0 commit comments