Edit model card

TrOCR Kurrent-Model 19th century

Base model: microsoft/trocr-base-handwritten

Train Lines: 292'997
Eval Lines: 7'513
Test Lines: 15'817

Epochs: 19.66 / 20
Eval CER: 0.02827
Test CER: 0.02655

Finetuned on Kurrent-dataset, containing:

  • Material from the State Archives of Zurich ("Regierungsratsprotokolle"), provided by the State Archives of Zurich
  • Lecture notes of Humboldt Lectures, provided by the Berlin-Brandenburgian Academy of Sciences
  • Diary of Eugen Huber, provided by the University of Zurich
  • Handwritting and Copies by and of Gottfried Semper
  • Konzilsprotokolle, University of Greifswald (19th century)
  • as well as many other smaller collections/examples

The model has not been extensively tested. Potential biases are still to be identified.

Downloads last month
12
Inference API
Inference API (serverless) does not yet support transformers models for this pipeline type.