PleIAs
/

journaux-lm-v1

Model card Files Files and versions Metrics Training metrics Community

stefan-it commited on 19 days ago

Commit

476b649

•

1 Parent(s): 05934a6

readme: add benchmark results

Files changed (1) hide show

README.md +35 -0

README.md CHANGED Viewed

@@ -25,6 +25,41 @@ Version 1 of the Journaux-LM was pretrained on the following publicly available
 In total, the pretraining corpus has a size of 408GB.
 # Changelog
 * 02.11.2024: Initial version of the model. More details are coming very soon!

 In total, the pretraining corpus has a size of 408GB.
+## Benchmarks (Named Entity Recognition)
+We compare our Zeitungs-LM directly to the French Europeana BERT model (as Zeitungs-LM is supposed to be the successor of it) on various downstream tasks from the [hmBench](https://github.com/stefan-it/hmBench) repository, which is focussed on Named Entity Recognition.
+We report averaged micro F1-Score over 5 runs with different seeds and use the best hyper-parameter configuration on the development set of each dataset to report the final test score.
+### Development Set
+The results on the development set can be seen in the following table:
+| Model \ Dataset     | [AjMC][1] | [ICDAR][2] | [LeTemps][3] | [NewsEye][4] | [HIPE-2020][5] | Avg.      |
+|:--------------------|:----------|:-----------|:-------------|:-------------|:---------------|:----------|
+| [Europeana BERT][6] | 85.7      | 77.63      | 67.14        | 82.68        | 85.98          | 79.83     |
+| Journaux-LM v1      | 86.25     | 78.51      | 67.76        | 84.07        | 88.17          | 80.95     |
+Our Journaux-LM leads to a performance boost of 1.12% compared to the German Europeana BERT model.
+### Test Set
+The final results on the test set can be seen here:
+| Model \ Dataset     | [AjMC][1] | [ICDAR][2] | [LeTemps][3] | [NewsEye][4] | [HIPE-2020][5] | Avg.      |
+|:--------------------|:----------|:-----------|:-------------|:-------------|:---------------|:----------|
+| [Europeana BERT][6] | 81.06     | 78.17      | 67.22        | 73.51        | 81.00          | 76.19     |
+| Journaux-LM v1      | 83.41     | 77.73      | 67.11        | 74.48        | 83.14          | 77.17     |
+Our Journaux-LM beats the French Europeana BERT model by 0.98%.
+[1]: https://github.com/hipe-eval/HIPE-2022-data/blob/main/documentation/README-ajmc.md
+[2]: https://github.com/stefan-it/historic-domain-adaptation-icdar
+[3]: https://github.com/hipe-eval/HIPE-2022-data/blob/main/documentation/README-letemps.md
+[4]: https://github.com/hipe-eval/HIPE-2022-data/blob/main/documentation/README-newseye.md
+[5]: https://github.com/hipe-eval/HIPE-2022-data/blob/main/documentation/README-hipe2020.md
+[6]: https://huggingface.co/dbmdz/bert-base-french-europeana-cased
 # Changelog
 * 02.11.2024: Initial version of the model. More details are coming very soon!