CASIA OpenIR  > 模式识别国家重点实验室  > 模式分析与学习
Simultaneous Script Identification and Handwriting Recognition via Multi-Task Learning of Recurrent Neural Networks
Chen, Zhuo1,2; Wu, Yichao1,2; Yin, Fei1; Liu, Chenglin1,2
2017
Conference NameInternational Conference on Document Analysis and Recognition
Conference Date2017.11.9-15
Conference PlaceKyoto, Japan
AbstractIn this paper, we propose a method for simultaneous script identification and handwritten text line recognition in multi-task learning framework. Firstly, we use Separable Multi-Dimensional Long Short-Term Memory (SepMDLSTM) to encode the input text line images based on convolutional feature extraction. Then, the extracted features are fed into two classification modules for script identification and multi-script text recognition, respectively. All the network parameters are trained end-to-end by multi-task learning where the script identification task and the text recognition task are aimed to minimize the Negative Log Likelihood (NLL) loss and Connectionist Temporal Classification (CTC) loss, respectively. We evaluated the performance of the proposed method on handwritten text line datasets of three languages, namely, IAM (English), Rimes (French) and IFN/ENIT (Arabic). Experimental results demonstrate the multitask learning framework performs superiorly for both script identification and text recognition. Particularly, the accuracy of script identification is higher than 99.9% and the character error rate (CER) of text recognition is even lower than that of some single-script text recognition systems.
Other Abstract该工作提出了一种在多任务学习框架(multi-task)中同时进行语种分类与文本识别的方法。
首先,我们使用卷积神经网络与可分离的MDLSTM对输入的文本行图像进行编码。然后提取出的特征分别输入到语种分类模块与多语种文本识别模块中。
网络的全部参数都是通过多任务学习进行端到端的更新,其中语种分类模块与文本识别模块的损失函数分别为负似然对数(NLL)与连接时序分类(CTC)。
在IAM(英语)、Rimes(法语)和IFN/ENIT(阿拉伯语)三个数据集进行实验,我们的网络的语种分类正确率为99.92%,字符集识别正确率分别为88.85,91.71,85.06。
KeywordMulti-task Learning Sepmdlstm Script Identification Language Identification Handwritten Text Recognition
Language英语
Document Type会议论文
Identifierhttp://ir.ia.ac.cn/handle/173211/20012
Collection模式识别国家重点实验室_模式分析与学习
Affiliation1.中国科学院自动化研究所
2.中国科学院大学
Recommended Citation
GB/T 7714
Chen, Zhuo,Wu, Yichao,Yin, Fei,et al. Simultaneous Script Identification and Handwriting Recognition via Multi-Task Learning of Recurrent Neural Networks[C],2017.
Files in This Item: Download All
File Name/Size DocType Version Access License
ICDAR2017-Simultaneo(467KB)会议论文 开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Chen, Zhuo]'s Articles
[Wu, Yichao]'s Articles
[Yin, Fei]'s Articles
Baidu academic
Similar articles in Baidu academic
[Chen, Zhuo]'s Articles
[Wu, Yichao]'s Articles
[Yin, Fei]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Chen, Zhuo]'s Articles
[Wu, Yichao]'s Articles
[Yin, Fei]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: ICDAR2017-Simultaneous Script Identification and Recognition via MTL of RNN-ChenZ.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.