Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,37 @@
|
|
1 |
---
|
2 |
license: mit
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
+
datasets:
|
4 |
+
- bsmock/pubtables-1m
|
5 |
+
tags:
|
6 |
+
- table structure recognition
|
7 |
+
- table extraction
|
8 |
---
|
9 |
+
|
10 |
+
# Model Card for TATR-v1.1-Pub
|
11 |
+
|
12 |
+
This repo contains the model weights for TATR (Table Transformer) v1.1 trained on the PubTables-1M dataset, using the training details in the paper: ["Aligning benchmark datasets for table structure recognition"](https://arxiv.org/abs/2303.00716).
|
13 |
+
|
14 |
+
These model weights are intended to be used with [the Microsoft implementation of Table Transformer (TATR)](https://github.com/microsoft/table-transformer).
|
15 |
+
|
16 |
+
This model (v1.1) was trained with additional image cropping compared to [v1.0](https://huggingface.co/bsmock/tatr-pubtables1m-v1.0) and works best on tightly cropped table images (5 pixels or less).
|
17 |
+
It was also trained for more epochs, and as a result it outperforms the original model on PubTables-1M.
|
18 |
+
|
19 |
+
Evaluation metrics in the paper were computed with the PubTables-1M v1.1 dataset, which tightly crops the table images in the test and validation splits.
|
20 |
+
Table images in PubTables-1M v1.0, on the other hand, have ~30 pixels of padding in all three splits (train, test, and val).
|
21 |
+
|
22 |
+
Model weights that can be loaded into the Hugging Face implementation of TATR are coming soon.
|
23 |
+
|
24 |
+
## Model Details
|
25 |
+
|
26 |
+
### Model Description
|
27 |
+
|
28 |
+
- **Developed by:** Brandon Smock and Rohith Pesala, while at Microsoft
|
29 |
+
- **License:** MIT
|
30 |
+
- **Finetuned from model:** DETR ResNet-18
|
31 |
+
|
32 |
+
### Model Sources
|
33 |
+
|
34 |
+
Please see the following for more details:
|
35 |
+
|
36 |
+
- **Repository:** ["https://github.com/microsoft/table-transformer"](https://github.com/microsoft/table-transformer)
|
37 |
+
- **Paper:** ["Aligning benchmark datasets for table structure recognition"](https://arxiv.org/abs/2303.00716)
|