READ revolutionizes access to handwritten documents

From the Middle Ages to today, from old Greek to modern English, from running text to tables or forms

About

READ's mission is to revolutionize access to archival documents with the support of cutting-edge technology such as Handwritten Text Recognition (HTR) and Keyword Spotting (KWS).

Learn more

Network

READ addresses archives and libraries, humanities scholars, family historians, volunteers - and computer scientists

Learn more

Research

Research in READ comprises exciting fields such as Artificial Intelligence, Pattern Recognition, Machine Learning and Natural Language Processing.

Learn more

Services

READ technology is available via the service platform Transkribus. Upload documents, train a Handwritten Text Recognition (HTR) model, process text and follow the progress of the project.

Learn more

Recent Posts

+ Digital Scholarly Editing in Switzerland – new READ events in Zürich

Staatsarchiv Zürich (one of the READ partners) and the University of Zürich (part of the READ MOU network) are organising two events in Zürich.

Firstly, a Transkribus workshop will take place at the University of Zürich in October 2016.  Due to high levels of interest, the workshop will be held twice – on 20 and 21 October.  The Transkribus software, as well as other tools being developed by the READ project, will be presented.  Participants will have the chance to work with their own documents and see what Transkribus is capable of regarding transcriptions and editions.

Second, on 10 November 2016, eight speakers will introduce tools used for digital scholarly editing to a public audience at the University of Zürich.  A keynote speech will be delivered by Tobias Grüning from the University of Rostock who will explain how recurrent neural networks are being used in READ for Handwritten Text Recognition (HTR).  There will also be talks about other digital editing tools and projects including ChartEX, histhub, corpus corporume-Manuscripta and TUSTEP.

Both events are open to all.  More details can be found in the full programme.  For more information and to register, please contact Tobias Hodel.

+ READ partners in Passau – update on our latest meeting

Passau Diocesan Archives took on the task of hosting the latest READ project meeting between 20 and 22 September 2016.   Over 30 individuals from the 14 READ project partners met together in the pretty town of Passau in Southern Germany to discuss the current and future progress of our research into the Handwritten Text Recognition (HTR) of historical documents.

The first part of our meeting took the form of a public symposium before an audience of German archivists and researchers.  To see what went on, check out this short video from Passau Diocesan Archives where staff from the archive talk about the conference and their participation in READ.  The video is in German but non-speakers can scroll down to find out more about the symposium and the READ project meeting.

As the keynote speaker, Gerhard Fürmetz, director of the Bavarian State Archives used his presentation to show how digitisation has changed the way archives work.

Gerhard Fürmetz, Digitalisierung und die Archive – Herausforderungen, Arbeitsfelder und Perspektiven

Gerhard Fürmetz,

Gerhard Fürmetz [Image by Elena Mühlbauer]

We then heard from Dr Herbert Wurster, director of Passau Diocesan Archives.  His archive has a large digitised collection of handwritten sacramental registers and is working with READ to facilitate the searching of these records for information relating to person names and births, marriages and deaths.

Dr Herbert Wurster, Kirchenbücher als Quelle der Forschung und Herausforderung für die automatisierte Verarbeitung

Dr Herbert Wurster

Dr Herbert Wurster [Image by Elena Mühlbauer]

Next up, the coordinator of the READ project Dr Günter Mühlberger from the Digitisation and Digital Preservation Group (DEA) at the University of Innsbruck presented on READ and the Transkribus transcription platform.  His talk described how archives can access Transkribus and outlined the way in which Handwritten Text Recognition engines produce automatic transcriptions of handwritten material.

Dr Günter Mühlberger, Transkribus. Eine Forschungsumgebung zur Erkennung und Anreicherung historicher Dokumente

img_2019

Dr Günter Mühlberger [Image by Elena Mühlbauer]

Finally, Dr Florian Kleber from the Computer Vision Lab at Vienna University of Technology demonstrated how the READ project is working with large and varied datasets of transcribed historical material.  Dr Kleber also explained that research competitions play a vital role in enabling computer scientists to evaluate and improve the effectiveness of their tools.

Dr Florian Kleber, Datasets and competitions as a means to increase excellent research

Dr Florian Kleber

Dr Florian Kleber [Image by Elena Mühlbauer]

Once we said goodbye to everyone at the symposium, the READ project meeting could begin.  During the first sessions, each project partner was given the opportunity to share news about their major achievements, possible setbacks and next steps.  We met for dinner to bring our first day to a close and were welcomed to Passau with a short speech from Klaus Metzl, Vicar General, Very Reverend.

Who wants to count the laptops in the room?

There are always a lot of laptops in the room at a READ meeting! [Image by Elena Mühlbauer]

The next day we formed working groups to tackle questions surrounding technical issues, research competitions and dissemination of the project outputs.  Groups deliberated the best means of producing training data for HTR engines and ways to improve the accuracy of keyword searches of handwritten material.  The development of new tools was also discussed, including a Table Recognition tool which will make it easier for Transkribus users to transcribe text in tables.

In the evening we were treated to a walking tour of Passau and a look behind at the scenes at the new building of the Diocesan Archives. It has been built on stilts to shield precious documents from any flooding from the town’s three rivers and the floor has been painted a liturgical shade of purple!

Passau Diocesan Archives [Image by Laurent Bolli]

unspecified

Sunset in Passau [Image by Laurent Bolli]

We concluded proceedings on our last day with some SWOT analysis – what are the strengths and weaknesses of the READ project?  What might be its opportunities and threats?  After some fruitful discussion, there was just enough time for a group picture before we parted ways!

img_2037

The READ project [Image by Elena Mühlbauer]

Thank you Passau! If you want to find out more about our meeting, take a look back at our twitter feed.  We look forward to continuing the discussions at our next project meeting in Brussels!