Knowledge Commons of Institute of Automation,CAS
CASIA-onDo: A New Database for Online Handwritten Document Analysis | |
Yu-Ting Yang1,2; Yan-Ming Zhang1; Xiao-Long Yun1,2; Fei Yin1; Cheng-Lin Liu1,2 | |
2022-04-10 | |
会议名称 | Asian Conference on Pattern Recognition |
会议日期 | 2021-12-09 |
会议地点 | Jeju Island, South Korea |
出版者 | Lecture Notes in Computer Science |
摘要 | In this paper we introduce an online handwritten document database (CASIA-onDo), serving as a standard database for the development and evaluation of methods in the field of online handwritten document layout analysis. It consists of 2,012 documents including a total of 841,159 online strokes. The database, covering Chinese and English languages, was produced by 200 writers. Six types of contents occur in the documents, namely text, formulas, diagrams, tables, figures, and lists. The distribution of different types is close to the actual situation. Benefiting from detailed annotations, CASIA-onDo can support different tasks of layout analysis under online or offline settings. Firstly, based on the semantic level annotation, it can be used for many classification tasks such as text/non-text classification, table/non-table classification, multi-class stroke classification and so on. Secondly, based on the instance level annotation, it can be used for segmentation tasks such as text line separation and formula segmentation. Thirdly, based on the various writing styles, it can be used for handwriting recognition and writer clustering tasks. In addition, we perform preliminary experiments to provide a benchmark on this database with a state-of-the-art method. More techniques can be evaluated on this challenging database in the future. |
关键词 | Online handwritten document Document layout analysis Stroke classification Database |
收录类别 | EI |
文献类型 | 会议论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/48859 |
专题 | 多模态人工智能系统全国重点实验室_模式分析与学习 |
作者单位 | 1.National Laboratory of Pattern Recognition, Institute of Automation of Chinese Academy of Sciences 2.School of Artificial Intelligence, University of Chinese Academy of Sciences |
第一作者单位 | 模式识别国家重点实验室 |
推荐引用方式 GB/T 7714 | Yu-Ting Yang,Yan-Ming Zhang,Xiao-Long Yun,et al. CASIA-onDo: A New Database for Online Handwritten Document Analysis[C]:Lecture Notes in Computer Science,2022. |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
CASIA-onDo A New Dat(2006KB) | 会议论文 | 开放获取 | CC BY-NC-SA | 浏览 下载 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论