Loading…
Multi-scene ancient chinese text recognition
•With the lack of related research, the problem of multi-scene ancient Chinese text recognition (MACR) is first proposed.•An multi-scene ancient Chinese text (MACT) dataset is established, which includes a testing set collected from real scenes and a synthetic training set.•A baseline method is used...
Saved in:
Published in: | Neurocomputing (Amsterdam) 2020-02, Vol.377, p.64-72 |
---|---|
Main Authors: | , , , , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | •With the lack of related research, the problem of multi-scene ancient Chinese text recognition (MACR) is first proposed.•An multi-scene ancient Chinese text (MACT) dataset is established, which includes a testing set collected from real scenes and a synthetic training set.•A baseline method is used for experiments with 66.94% top-1 accuracy, which far surpasses subjective human recognition performance, and an improved method based on multi-model ensemble (MME) is proposed, which achieves 73.36% top-1 accuracy.
Multi-scene ancient Chinese text recognition (MACR) is a challenging task for ordinary people without relevant professional knowledge. Due to language barriers and the lack of related open datasets, there is little research on MACR. In this paper, a multi-scene ancient Chinese text (MACT) dataset, formed by handwritten text, calligraphy, natural scene text in ancient fonts, is established that includes synthetic samples generated for training and real scene samples collected for testing. We first perform experiments on some CNN structures as the baseline method, and the top-1 recognition result, 66.94%, is approximately 13.96% higher than subjective human recognition results. Furthermore, based on these models and confidence score from the baseline, a multi-model ensemble (MME) method is proposed, which adopts auxiliary datasets and a feature extraction method to augment data before training, utilizes different hyper-parameters to optimize in training, and integrates multiple models to improve recognition accuracy. The top-1 accuracy results of the MME method reach 73.36% and other top-n results also greatly surpass the baseline results. The MACT dataset is publicly available on the website11https://1drv.ms/u/s!AqAU14ep3HF7bhNy8KcOtjfEpbI. https://pan.baidu.com/s/1y-KdY-mzVfwDyKxNles6vw, Passward: 7dqu.. |
---|---|
ISSN: | 0925-2312 1872-8286 |
DOI: | 10.1016/j.neucom.2019.10.029 |