Loading…

Historical Documents and automatic text recognition

With this special issue of the Journal of Data Mining and Digital Humanities (JDMDH), we bring together in one single volume several experiments, projects and reflections related to automatic text recognition on Historical documents.Many projects now include automatic text acquisition in their data...

Full description

Saved in:
Bibliographic Details
Published in:Journal of Data Mining and Digital Humanities 2024
Main Authors: Pinche, Ariane, Stokes, Peter Anthony
Format: Text Resource
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:With this special issue of the Journal of Data Mining and Digital Humanities (JDMDH), we bring together in one single volume several experiments, projects and reflections related to automatic text recognition on Historical documents.Many projects now include automatic text acquisition in their data processing chain. The integration of this technology into increasingly powerful processing chains has led to an automation of tasks that affects the role of the researcher in the textual production process. This new data-intensive practice makes it urgent to collect and harmonise the corpora necessary for the constitution of training sets, but also tomake them available for exploitation. This issue is an opportunity to propose articles combining philological and technical questions to make a scientific assessment of the use of automatic text recognition for ancient documents, its results, its contributions and the new practices induced by its use in the processof editing and exploring texts. We hope that practical aspects will be questioned on this occasion, while raising methodological challenges and its impact on research data.The special issue on Automatic Text Recognition (ATR) is dedicated to providing a comprehensive overview of the use of ATR in the humanities field, particularly concerning historical documents in the early 2020s. This issue presents a fusion of engineering and philological aspects, catering to both beginners and experienced users interested in launching projects with ATR. The collection encompasses a diverse array of approaches, covering topics such as data creation or collection for training generic models, reaching specific objectives, technical and HTR machine architecture, segmentation methods, and image processing. Grâce à ce numéro spécial du Journal of Data Mining and Digital Humanities (JDMDH), nous rassemblons en un seul volume plusieurs expériences, projets et réflexions liés à la reconnaissance automatique de texte sur des documents historiques.De nombreux projets incluent désormais l'acquisition automatique de textes dans leur chaîne de traitement des données. L'intégration de cette technologie dans des chaînes de traitement de plus en plus performantes a conduit à une automatisation des tâches qui affecte le rôle du chercheur dans le processus de production textuelle. Cette nouvelle pratique gourmande en données rend urgente la collecte et l'harmonisation des corpus nécessaires à la constitution de jeux d'entraînement, mais
ISSN:2416-5999