Automated Text Recognition – Extracting Data via OCR/HTR

Automated or optical text recognition (OCR) is used to automatically capture text from digital images and thus generate searchable and analyzable data. The Mannheim University Library has many years of experience in digitization and with the use of various text recognition software.

The Research Data Center is happy to support researchers at the University of Mannheim along the entire workflow from digitization to layout and text recognition as well as training specialized models and structuring of the data.


  • Consulting on automated text recognition (OCR) for research projects
  • OCR recommender (work in progress)
  • Open OCR consultation hour: every 2nd Thursday of the month, from 3 to 4 p.m., without registration (link to Zoom meeting:, meeting ID: 682 8185 1819, ID code: 443071).

In our FAQs you will find answers to the most frequently asked questions about automated text recognition and the software used in the OCR-BW.

If the answer you are looking for is not listed, simply contact us by e-mail.

Projects and cooperations

If we can support you or if you have any questions, please do not hesitate to contact us.


Larissa Will, M.A.

Larissa Will, M.A.

Research Data Consultant (Digital Humanities)
University of Mannheim
University Library
Schloss Schneckenhof West – Room SW 273
68161 Mannheim