bsmock commited on
Commit
bd1de99
1 Parent(s): 1c66175

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md CHANGED
@@ -1,3 +1,37 @@
1
  ---
2
  license: mit
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
+ datasets:
4
+ - bsmock/pubtables-1m
5
+ tags:
6
+ - table structure recognition
7
+ - table extraction
8
  ---
9
+
10
+ # Model Card for TATR-v1.1-Pub
11
+
12
+ This repo contains the model weights for TATR (Table Transformer) v1.1 trained on the PubTables-1M dataset, using the training details in the paper: ["Aligning benchmark datasets for table structure recognition"](https://arxiv.org/abs/2303.00716).
13
+
14
+ These model weights are intended to be used with [the Microsoft implementation of Table Transformer (TATR)](https://github.com/microsoft/table-transformer).
15
+
16
+ This model (v1.1) was trained with additional image cropping compared to [v1.0](https://huggingface.co/bsmock/tatr-pubtables1m-v1.0) and works best on tightly cropped table images (5 pixels or less).
17
+ It was also trained for more epochs, and as a result it outperforms the original model on PubTables-1M.
18
+
19
+ Evaluation metrics in the paper were computed with the PubTables-1M v1.1 dataset, which tightly crops the table images in the test and validation splits.
20
+ Table images in PubTables-1M v1.0, on the other hand, have ~30 pixels of padding in all three splits (train, test, and val).
21
+
22
+ Model weights that can be loaded into the Hugging Face implementation of TATR are coming soon.
23
+
24
+ ## Model Details
25
+
26
+ ### Model Description
27
+
28
+ - **Developed by:** Brandon Smock and Rohith Pesala, while at Microsoft
29
+ - **License:** MIT
30
+ - **Finetuned from model:** DETR ResNet-18
31
+
32
+ ### Model Sources
33
+
34
+ Please see the following for more details:
35
+
36
+ - **Repository:** ["https://github.com/microsoft/table-transformer"](https://github.com/microsoft/table-transformer)
37
+ - **Paper:** ["Aligning benchmark datasets for table structure recognition"](https://arxiv.org/abs/2303.00716)