Knowledge Commons of Institute of Automation,CAS
Cross-modal Prototype Learning for Zero-shot Handwriting Recognition | |
Ao, Xiang1,2![]() ![]() ![]() ![]() ![]() | |
2019-09 | |
会议名称 | 15th International Conference on Document Analysis and Recognition |
会议日期 | 20-25 Septemper 2019 |
会议地点 | Sydney, Australia |
摘要 | In contrast to machine recognizers that rely on training with large handwriting data, humans can recognize handwriting accurately on learning from few samples, and can even generalize to handwritten characters from printed samples. Simulating this ability in machine recognition is important to alleviate the burden of labeling large handwriting data, especially for large category set as in Chinese text. In this paper, inspired by human learning, we propose a cross-modal prototype learning (CMPL) method for zero-shot online handwritten character recognition: for unseen categories, handwritten characters can be recognized without learning from handwritten samples, but instead from printed characters. Particularly, the printed characters (one for each class) are embedded into a convolutional neural network (CNN) feature space to obtain prototypes representing each class, while the online handwriting trajectories are embedded with a recurrent neural network (RNN). Via cross-modal joint learning, handwritten characters can be recognized according to the printed prototypes. For unseen categories, handwritten characters can be recognized by only feeding a printed sample per category. Experiments on a benchmark Chinese handwriting database have shown the effectiveness and potential of the proposed method for zero-shot handwriting recognition. |
关键词 | printed character handwritten character cross-modal prototype learning zero-shot |
收录类别 | EI |
七大方向——子方向分类 | 文字识别与文档分析 |
国重实验室规划方向分类 | 小样本高噪声数据学习 |
是否有论文关联数据集需要存交 | 否 |
文献类型 | 会议论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/56731 |
专题 | 多模态人工智能系统全国重点实验室_模式分析与学习 |
作者单位 | 1.National Laboratory of Pattern Recognition, Institute of Automation of Chinese Academy of Sciences, 95 Zhongguancun East Road, Beijing 100190, P.R. China 2.University of Chinese Academy of Sciences, Beijing, P.R. China 3.CAS Center for Excellence in Brain Science and Intelligence Technology, Beijing 100049, P.R. China |
第一作者单位 | 模式识别国家重点实验室 |
推荐引用方式 GB/T 7714 | Ao, Xiang,Zhang, Xu-Yao,Yang, Hong-Ming,et al. Cross-modal Prototype Learning for Zero-shot Handwriting Recognition[C],2019. |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
Cross-modal Prototyp(226KB) | 会议论文 | 开放获取 | CC BY-NC-SA | 浏览 下载 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论