论文标题
随机线条擦除和课程学习的日本日本kuzushiji文档的自动转录
Automated Transcription for Pre-Modern Japanese Kuzushiji Documents by Random Lines Erasure and Curriculum Learning
论文作者
论文摘要
由于复杂的布局/背景和编写样式的难度,例如草书和连接的角色,因此认识到日本历史文档的整页是一个具有挑战性的问题。以前的大多数方法将识别过程分为字符分割和识别。但是,这些方法仅提供字符边界框和类,而无需文本转录。在本文中,我们将以前的人类识别系统从多行扩大到Kuzushiji文档的整页。人类风格的识别系统模拟了阅读过程中的人眼运动。对于缺乏培训数据,我们提出了一种随机的文本擦除方法,该方法随机擦除文本线并扭曲文档。对于全页文档的识别系统的收敛问题,我们采用课程学习,将识别系统从简单级别(几个文本文档)到困难级别(全页文档)逐步训练识别系统。我们在Kaggle的Kuzushiji识别竞赛数据集上测试了步骤训练方法和随机文本擦除方法。实验的结果证明了我们提出的方法的有效性。这些结果与Kuzushiji认可竞赛的其他参与者竞争。
Recognizing the full-page of Japanese historical documents is a challenging problem due to the complex layout/background and difficulty of writing styles, such as cursive and connected characters. Most of the previous methods divided the recognition process into character segmentation and recognition. However, those methods provide only character bounding boxes and classes without text transcription. In this paper, we enlarge our previous humaninspired recognition system from multiple lines to the full-page of Kuzushiji documents. The human-inspired recognition system simulates human eye movement during the reading process. For the lack of training data, we propose a random text line erasure approach that randomly erases text lines and distorts documents. For the convergence problem of the recognition system for fullpage documents, we employ curriculum learning that trains the recognition system step by step from the easy level (several text lines of documents) to the difficult level (full-page documents). We tested the step training approach and random text line erasure approach on the dataset of the Kuzushiji recognition competition on Kaggle. The results of the experiments demonstrate the effectiveness of our proposed approaches. These results are competitive with other participants of the Kuzushiji recognition competition.