Research publications

Journal Articles

Fronhöfer, A. and Mühlbauer, E. (2017) ‘Archivnutzung ohne Limit. Digitalisierung, Onlinestellung und das Projekt READ für barrierefreies Forschen’, Der Archivar, Zeitschrift für Archivwesen 70, pp. 422–7.

Giotis, A., Sfikas, G., Gatos, B. and Nikou, C. (2017) ‘A survey of document image word spotting techniques’, Pattern Recognition 68, pp. 310–32 (DOI: 10.1016/j.patcog.2017.02.023).

Granell, E. and Martinez-Hinarejos, C. (2017). Multimodal Crowdsourcing for Transcribing Handwritten Documents’, IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (2), pp. 409–19 (DOI: 10.1109/taslp.2016.2634123).

Granell, E., Romero, V. and Martínez-Hinarejos, C. (2018) ‘Multimodality, interactivity, and crowdsourcing for document transcription’, Computational Intelligence 34 (2), pp. 398–419 (DOI: 10.1111/coin.12169).

Grüning, T., Leifert, G., Strauß, T. and Labahn, R. (2018) ‘A Two-Stage Method for Text Line Detection in Historical Documents’, Arxiv (2018).

Hernández Tornero, C., Romero Gómez, V., Sánchez, J.A., Toselli Rossi, A.H., and Vidal Ruiz, E. (2018) ‘Indexación y reconocimiento automático de texto manuscrito’, Cuadernos AISPI: Estudios De Lenguas Y Literaturas Hispánicas 11, pp. 131–46.

Mühlberger, G. (2017) ‘Archiv 4.0 oder warum die automatisierte Texterkennung alles verändern wird Tagungsband Archivtag Wolfsburg’, 87 Verband Deutscher Archivare.

Puigcerver, J., Toselli, A. and Vidal, E. (2016) ‘Querying out-of-vocabulary words in lexicon-based keyword spotting’, Neural Computing and Applications 28 (9), pp. 2373–82 (DOI: 10.1007/s00521-016-2197-8).

Retsinas, G., Sfikas, G., and Gatos, B. (2017) ‘Transferable Deep Features for Keyword Spotting’, Proceedings 2 (2), p. 89 (DOI: 10.3390/proceedings2020089).

Sánchez, J., Rocha, M., Romero, V. and Villegas, M. (2018) ‘On the Derivational Entropy of Left-to-Right Probabilistic Finite-State Automata and Hidden Markov Models’, Computational Linguistics 44 (1), pp. 17–37 (DOI: 10.1162/COLI_a_00306).

Strauß, T., Leifert, G., Grüning, T., Labahn, R. (2016) ‘Regular expressions for decoding of neural network outputs’Neural Networks 79, pp. 1–11 (DOI: 10.1016/j.neunet.2016.03.003).

Toselli, A., Leiva, L., Bordes-Cabrera, I., Hernández-Tornero, C., Bosch, V. and Vidal, E. (2017), ‘Transcribing a 17th-century botanical manuscript: Longitudinal evaluation of document layout detection and interactive transcription’, Digital Scholarship in the Humanities 33 (1), pp. 173–202 (DOI:10.1016/j.ins.2016.07.063).

Toselli, A., Romero, V. and Vidal, E. (2016) ‘Word graphs size impact on the performance of handwriting document applications’, Neural Computing and Applications 28 (9), pp. 2477–87 (DOI: 10.1007/s00521-016-2336-2).

Zagoris, K., Pratikakis, I. and Gatos, B. (2017) ‘Unsupervised Word Spotting in Historical Handwritten Document Images Using Document-Oriented Local Features’, IEEE Transactions on Image Processing 26 (8), pp. 4032–41 (DOI: 10.1109/TIP.2017.2700721).

Book Chapters

Barlas, G., Zagoris, K. and Pratikakis, I. (2017) ‘Handwritten keyword spotting – The Query by Example (QbE) case’, in Handwriting: Recognition, Development and Analysis, eds. Dantas, B.L., Bezerra, C.Z., Toselli, A.H., and Pirlo, G. (Nova Science Publishers), ISBN: 978-1-53611-937-4.

Fawzi, A., Pastor, M., Martínez-Hinarejos, C.D. (2017) ‘Baseline Detection on Arabic Handwritten Documents’, in Proceedings of the ACM Symposium on Document Engineering, pp. 193–6, ISBN: 978-1-4503-4689-4.

Gatos, B., Louloudis, G., Stamatopoulos, N. and Sfikas, G. (2017) ‘Historical Document Processing’, in Handwriting: Recognition, Development and Analysis, eds. Dantas, B.L., Bezerra, C.Z., Toselli, A.H., and Pirlo, G. (Nova Science Publishers), ISBN: 978-1-53611-937-4.

Granell, E., Romero, V. and Martínez Hinarejos, C.D. (2017) ‘Using Speech and Handwriting in an Interactive Approach for Transcribing Historical Documents’, in Handwriting: Recognition, Development and Analysis, eds. Dantas, B.L., Bezerra, C.Z., Toselli, A.H., and Pirlo, G. (Nova Science Publishers), ISBN: 978-1-53611-937-4.

Louloudis, G., Stamatopoulos, N. and Gatos, B. (2017) ‘Handwriting Segmentation’, in Document Analysis and Text Recognition: Benchmarking State-of-the-Art Systems (World Scientific Publishing Co.), ISBN: 978-981-3229-26-6.

Louloudis, G., Stamatopoulos, N. and Gatos, B. (2017) ‘Writer Identification’, in Document Analysis and Text Recognition: Benchmarking State-of-the-Art Systems (World Scientific Publishing Co.), ISBN: 978-981-3229-26-6

Noya-García, E., Toselli, A.H., Vidal, E. (2017) ‘Simple and Effective Multi-word Query Spotting in Handwritten Text Images’, in 8th Iberian Conference on Pattern Recognition and Image Analysis, pp. 76-84 (Springer International Publishing), ISBN: 978-3319989310.

Quirós, L., Martínez-Hinarejos, C.D., Toselli, A.H., Vidal, E. (2017) ‘Interactive Layout Detection’, 8th Iberian Conference on Pattern Recognition and Image Analysis, pp. 161–8 (Springer International Publishing), ISBN: 978-3319989310.

Romero, V., Bosch, V., Hernández, C., Vidal, E., Sánchez, J.A. ‘A Historical Document Handwriting Transcription End-to-end System’, 8th Iberian Conference on Pattern Recognition and Image Analysis, pp. 149–57 (Springer International Publishing), ISBN: 978-3319989310.

Romero, V., Fornés, A., Vidal, E., Sánchez, J.A. (2017) ‘Information Extraction in Handwritten Marriage Licenses Books Using the MGGI Methodology’, 8th Iberian Conference on Pattern Recognition and Image Analysis, pp. 287–94 (Springer International Publishing), ISBN: 978-3319989310.

Sánchez, J.A., Romero, V., Toselli, A.H. and Vidal, E. (2018) ‘Handwritten Text Recognition Competitions with the tranScriptorium Dataset’, in Document Analysis and Text Recognition, pp. 213–39, ISBN: 978-9813229266.

Verónica, R., Bosch, V., Celio, H., Vidal, E., Sánchez, J.A. (2017) ‘A Historical Document Handwriting Transcription End-to-end System’, Pattern Recognition and Image Analysis. IbPRIA, Lecture Notes in Computer Science, vol. 10,255, pp. 149–57, ISBN: 978-3642386275.

Vidal, E. (2017) ‘Advances in Handwritten Keyword Indexing and Search Technologies’, in Kodikologie und Paläographie im Digitalen Zeitalter 4 – Codicology and Palaeography in the Digital Age 4. Schriften des Instituts für Dokumentologie und Editorik 11 (Books on Demand, Norderstedt). pp. 103–19, ISBN: 978-3-7448-3877-1.

Villegas, M., Müller, H., Garcia Seco de Herrera, A., Schaer, R., Bromuri, S., Gilbert, A., Piras, L. Wang, J., Yan, F., Ramisa, A., Dellandrea, E., Gaizauskas, R., Mikolajczyk, K., Puigcerver, J. Toselli, A.H., Sánchez, J.A. Vidal, E. (2016) ‘General Overview of ImageCLEF at the CLEF 2016 Labs’, in Experimental IR Meets Multilinguality, Multimodality, and Interaction, Lecture Notes in Computer Science, vol. 9,822, pp. 267–85 (Springer International Publishing), ISBN: 978-3319989310.

Conference Proceedings

Bluche, T., Hamel, S., Kermorvant, C., Puigcerver, J., Stutzmann, D., Toselli, A.H., Vidal, E. (2017) ‘Preparatory KWS Experiments for Large-Scale Indexing of a Vast Medieval Manuscript Collection in the HIMANIS Project’, ICDAR, pp. 312–17 (DOI: 10.1109/ICDAR.2017.59).

Bosch, V., Romero, V., Toselli, A.H., Vidal, E. (2018) ‘Text Line Extraction Based on Distance Map Features and Dynamic Programming’, International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 357–62 (DOI: 10.1109/ICFHR-2018.2018.00069).

Bryan, M., Hodel, T., Philipp, N. (2018) ‘Generierung von Trainingsdaten für die Handschrifterkennung aus TEI annotierten Dokumenten – Ein Erfahrungsbericht aus dem EU-Projekt READ’, GI-Workshop: Im Spannungsfeld zwischen Tool-Building und Forschung auf Augenhöhe – Informatik und die Digital Humanities.

C.D. Martínez-Hinarejos, E. Granell-Romero, V. Romero-Gómez (2018) ‘Comparing different feedback modalities in assisted transcription of manuscripts’, IAPR International Workshop on Document Analysis Systems (DAS), pp. 115–20 (DOI: 10.1109/DAS.2018.13).

Calvo-Zaragoza, J. (2017) ‘Handwritten Music Recognition for Mensural Notation: Formulation, Data and Baseline Results’, ICDAR, pp. 1081–6 (DOI: 10.1109/ICDAR.2017.179).

Calvo-Zaragoza, J., Toselli, A.H. and Vidal, E. (2016) ‘Early Handwritten Music Recognition with Hidden Markov Models’, International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 319–24 (DOI: 10.1109/ICFHR.2016.0067).

Calvo-Zaragoza, J., Toselli, A.H., Vidal, E. (2018) ‘Probabilistic Music-Symbols Spotting in handwritten scores’, International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 558–63 (DOI: 10.1109/ICFHR-2018.2018.00103).

Christlein, V., Gropp, M., Fiel, S. and Maier, A. (2017) ‘Unsupervised Feature Learning for Writer Identification and Writer Retrieval’, ICDAR, pp. 991–7 (DOI: 10.1109/ICDAR.2017.165).

Diem, M., Kleber, F., Fiel, S., Grüning, T. and Gatos, B. (2017) ‘cBAD: Competition on Baseline Detection’, ICDAR, pp. 1355–60 (DOI: 10.1109/ICDAR.2017.222).

  1. Lang, J.  Puigcerver, A.H.  Toselli, E. Vidal, (2018) ‘Probabilistic Indexing and Search for Information Extraction on Handwritten German Parish Records’, International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 44–9 (DOI: 10.1109/ICFHR-2018.2018.00017).

Fiel, S., Kleber, F., Diem, M., Christlein, V., Louloudis, G., Stamatopoulos, N. and Basilis, B. (2017) ‘Competition on Historical Document Writer Identification’, ICDAR, pp. 1377–82 (DOI: 10.1109/ICDAR.2017.225).

Fornés, A., Romero, V., Baró, A., Toledo, J.I., Sánchez, J.A., Vidal, E., Lladós, J. (2017) ‘Competition on Information Extraction in Historical Handwritten Records’, ICDAR, pp. 1389–94 (DOI: 10.1109/ICDAR.2017.227).

Gatos, B., Louloudis, G., Stamatopoulos, N. and Sfikas, G. (2017) ‘Historical Document Processing’, 17th ACM Symposium on Document Engineering.

Granell-Romero, E., Martínez-Hinarejos, C.D., Romero-Gómez, V. (2018) ‘Advances on the Transcription of Historical Manuscripts based on Multimodality, Interactivity and Crowdsourcing’, IberSPEECH 2018, pp. 174–8 (DOI: 10.21437/IberSPEECH.2018-35).

Gruning, T., Labahn, R., Diem, M., Kleber, F. and Fiel, S. (2018) ‘READ-BAD: A New Dataset and Evaluation Scheme for Baseline Detection in Archival Documents’, IAPR International Workshop on Document Analysis Systems (DAS), pp. 351–6.

Kahle, P., Colutto, S., Hackl, G. and Mühlberger, G. (2017) ‘Transkribus – a Platform for Transcription, Recognition and Retrieval of Document Images’, IAPR International Conference on Document Analysis and Recognition (ICDAR), IEEE, pp. 19–24 (DOI: 10.1109/ICDAR.2017.307).

Kleber, F., Diem, M. and Sablatnig, R. (2016) ‘Accuracy of Gradient based Skew Estimation’, IAPR International Workshop on Document Analysis Systems (DAS), pp. 19–20.

Kleber, F., Diem, M., Dejean, H., Meunier, J.-L., Lang, E. (2018) ‘Matching Table Structures of Historical Register Books using Association Graphs’, 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), IEEE. pp. 217-22 (DOI: 10.1109/ICFHR-2018.2018.00046).

Kleber, F., Diem, M., Hollaus, F. and Fiel, S. (2017) ‘Mass Digitization of Archival Documents using Mobile Phones’, IAPR International Conference on Document Analysis and Recognition (ICDAR), p. 65-70.

Martín-Albo, D., Leiva, L.A. and Plamondon, R. (2016) ‘On the Design of Personal Digital Bodyguards: Impact of Hardware Resolution on Handwriting Analysis’, International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 174–9 (DOI: 10.1109/ICFHR.2016.0043).

Meunier, J.-L. (2017) ‘Joint Structured Learning and Predictions under Logical Constraints in Conditional Random Fields’, Caps 2017 Conference sur l’apprentissage.

Meunier, J.-L. (2017) ‘PyStruct Extension for Typed CRF Graphs’, IAPR International Conference on Document Analysis and Recognition (ICDAR), IEEE (DOI: 10.1109/ICDAR.2017.305).

Meunier, J.-L. and Déjean, H. (2017) ‘Transkribus Python Toolkit’, IAPR International Conference on Document Analysis and Recognition (ICDAR), IEEE.

Oliveira, S.A., di Lenardo, I., Kaplan, F. (2017) ‘Machine Vision algorithms on cadaster plans’, Premiere Annual Conference of the International Alliance of Digital Humanities Organizations.

Pratikakis, I., Zagoris, K, Barlas, G. and Gatos, B. (2017) ‘Competition on Document Image Binarization (DIBCO 2017)’, IAPR International Conference on Document Analysis and Recognition (ICDAR), pp. 1395–1403 (DOI: 10.1109/ICDAR.2017.90).

Pratikakis, I., Zagoris, K., Gatos, B., Puigcerver, J., Toselli, A.H, Vidal, E. (2016) ‘Handwritten Keyword Spotting Competition (H-KWS 2016)’, ICFHR, pp. 613–18 (DOI: 10.1109/ICFHR.2016.0117).

Puigcerver, J. (2017) ‘Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?’, ICDAR, pp. 67–72 (DOI: 10.1109/ICDAR.2017.20).

Quirós, L., Bosch, V., Serrano, L., Toselli, A.H., Vidal, E. (2018) ‘From HMMs to RNNs: Computer-assisted Transcription of a Handwritten Notarial Records Collection’, International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 116–21 (DOI: 10.1109/ICFHR-2018.2018.00029).

Retsinas, G., Stamatopoulos, N., Louloudis, G., Sfikas, G. and Gatos, B. (2017) ‘Nonlinear Manifold Embedding on Keyword Spotting Using t-SNE’, ICDAR, pp. 487–92 (DOI: 10.1109/ICDAR.2017.86).

Romero, V., Fornés, A., Sánchez, J.A. and Vidal, E. ‘Using the MGGI Methodology for Category-based Language Modeling in Handwritten Marriage Licenses Books’, International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 331–6 (DOI: 10.1109/ICFHR.2016.0069).

Romero, V., Toselli, A.H., Bosch, V., Sánchez, J.A., Vidal, E. (2018) ‘Automatic Alignment of Handwritten Images and Transcripts for Training Handwritten Text Recognition Systems’, IAPR International Workshop on Document Analysis Systems (DAS), pp. 328–33 (DOI: 10.1109/DAS.2018.41).

Romero, V., Toselli, A.H., Sánchez, J.A. and Vidal, E. (2016) ‘Handwriting Transcription and Keyword Spotting in Historical Daily Records Documents’, IAPR International Workshop on Document Analysis Systems (DAS), pp. 275–80 (DOI: 10.1109/DAS.2016.70).

Romero, V., Vidal, E., Toselli, A.H., Puigcerver, J. and Leiva, L.A. (2016) ‘Computer Assisted Transcription and Indexing of Handwritten Historical Documents Demonstration’, IAPR International Workshop on Document Analysis Systems (DAS).

Sánchez, J. A. and Pal, U. (2016) ‘Handwritten Text Recognition for Bengali’, 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 542–7 (DOI: 10.1109/ICFHR.2016.0105).

Sánchez, J. A., Romero, V., Toselli, A.H., Vidal, E. (2016) ‘Competition on Handwritten Text Recognition on the READ Dataset’, ICFHR, pp. 630–35 (DOI: 10.1109/ICFHR.2016.0120).

Sánchez, J. A., Romero, V., Toselli, A.H., Villegas, M., Vidal, E. (2017) ‘Competition on Handwritten Text Recognition on the READ Dataset’, ICDAR, pp. 1383–8 (DOI: 10.1109/ICDAR.2017.226).

Sfikas, G., Gatos, B., and Nikou, C. (2017) ‘SemiCCA: A New Semi-Supervised Probabilistic CCA Model for Keyword Spotting’, ICIP (DOI: 10.1109/ICIP.2017.8296453).

Sfikas, G., Retsinas, G. and Gatos, B. (2017) ‘A PHOC Decoder for Lexicon-Free Handwritten Word Recognition’, ICDAR, pp. 513–18 (DOI: 10.1109/ICDAR.2017.90).

Strauß, T., Grüning, T., Leifert, G., Labahn, R. (2016) ‘CITlab ARGUS for Keyword Search in Historical Handwritten Documents: Description of CITlab’s System for the ImageCLEF 2016 Handwritten Scanned Document Retrieval Task’, CLEF2016 Working Notes, vol. 1,609, pp. 399–412.

Strauß, T., Weidemann, M., Johannes, M., Leifert, G. Grüning, T. and Labahn, R. (2017) ‘System Description of CITlab’s Recognition & Retrieval Engine for ICDAR2017 Competition on Information Extraction in Historical Handwritten Records’, pp. 1–4.

Toselli, A. H., Puigcerver, J., Vidal, E. (2016) ‘Two Methods to Improve Confidence Scores for Lexicon-free Word Spotting in Handwritten Text’, ICFHR, pp. 349–54 (DOI: 10.1109/ICFHR.2016.0072).

Toselli, A.H., Granell, E., Puigcerver, J. (2018) ‘Keyword Spotting for Large-Scale Indexing and Search in Massive Document Collections’, IAPR International Workshop on Document Analysis Systems (DAS).

Villegas, M. Puigcerver, J., Toselli, A.H., Sánchez, J.A. and Vidal, E. (2016) ‘Overview of the ImageCLEF 2016 Handwritten Scanned Document Retrieval Task’, CLEF, CEUR Workshop Proceeding, pp. 233–53.

Villegas, M., Toselli, A.H., Romero, V. and Vidal, E. (2016) ‘Exploiting Existing Modern Transcripts for Historical Handwritten Text Recognition’, International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 66–71 (DOI: 10.1109/ICFHR.2016.0025).

Zagoris, K. and Pratikakis, I. ‘Bio-Inspired Modeling for the Enhancement of Historical Handwritten Documents’, ICDAR, IEEE Computer Society Press, pp. 288–93 (DOI: 10.1109/ICDAR.2017.55).

Other

Strauß, T. (2017) ‘Decoding the output of neural networks – A discriminative approach’, Universität Rostock, thesis (DOI: 10.18453/rosdok_id00001919).