READ revolutionizes access to handwritten documents

From the Middle Ages to today, from old Greek to modern English, from running text to tables or forms

About

READ's mission is to revolutionize access to archival documents with the support of cutting-edge technology such as Handwritten Text Recognition (HTR) and Keyword Spotting (KWS).

Learn more

Network

READ addresses archives and libraries, humanities scholars, family historians, volunteers - and computer scientists

Learn more

Research

Research in READ comprises exciting fields such as Artificial Intelligence, Pattern Recognition, Machine Learning and Natural Language Processing.

Learn more

Services

READ technology is available via the service platform Transkribus. Upload documents, train a Handwritten Text Recognition (HTR) model, process text and follow the progress of the project.

Learn more

Recent Posts

+ HTR+ reads old Slavonic documents with 3-5 % Character Error Rate

Recently our new HTR+ was tested on different styles of Church Slavonic handwritings by Achim Rabus, who is holding the Chair of Slavic Linguistics at the University of Freiburg in Germany. With Transkribus’ technology the error rates went down to 3 to 5 percent. Superscript letters, abbreviations and word separation are the challenges the HTR+ had to deal with.

A paper on the topic of recognizing handwritten text in Slavic manuscripts with Transkribus is about to be publicised by Achim Rabus. Within this project he discovered the potential of Transkribus when it comes to the digitizing of Church Slavonic manuscripts: the possibility to search in big documents without even having a special model for the individual handwriting and the opportunity to avoid a full manual transcription and instead just correcting the mistakes of the automated transcription makes “digitisation-life” a lot easier.

Part of the models Achim Rabus has trained already contain different hands and provide useful automatic transcripts. Nevertheless the READ-Team is working on further improving Transkribus in the way, that also for documents with mixed handwritings automatic transcripts with low character error rate can be produced.

Cooperation is the key for getting out the biggest benefit for everybody. That is also what Achim Rabus is convinced of and therefore he is happy to share his model with interested people. You can get in touch with him via email: achim.rabus@slavistik.uni-freiburg.de

You can have a look at the draft of the paper Recognizing handwritten text in Slavic manuscripts: A neural-network approach using Transkribus under the following link: https://www.academia.edu/38835297/Recognizing_handwritten_text_in_Slavic_manuscripts_A_neural-network_approach_using_Transkribus_1_Achim_Rabus

Source: Rabus, Achim: Recognizing handwritten text in Slavic manuscripts: A neural-network approach using Transkribus

 

+ Transkribus-support for DIGITENS

Transkribus now helps to produce a digital encyclopaedia, containing articles regarding sociability during the Age of British enlightenment. This should be achieved within the H2020 DIGITENS-project coordinated by the University of Western Brittany (UBO) in Brest, France, which gives young scholars the chance to get familiar with new digital humanities research tools and with the work in archives. At the same time the project opens them up to the opportunity to spend time abroad and therefore supports mobility.

For the DIGITENS-project, it is important to have a standardized workflow in order to work efficiently. This is where Transkribus comes into play. Our software makes it possible to cover the whole workflow from scanning with the ScanTent and the DocScan app up to international cooperation in using Handwritten Text Recognition models. This way Transkribus can give the project the required infrastructure.

To give the scholars an insight into the work with Transkribus, we have organised a workshop in Brest at the UBO on 22nd of May.

For more information about the project, visit the DIGITENS website: https://www.univ-brest.fr/digitens/

The project is coordinated by the GIS Sociability and the research lab HCTI. The DIGITENS encyclopedia will soon be available online at the following address:  http://www.digitens.fr/1/accueil

http://www.digitens.fr/1/accueil