CASIA-onDo: A New Database for Online Handwritten Document Analysis
Yu-Ting Yang1,2; Yan-Ming Zhang1; Xiao-Long Yun1,2; Fei Yin1; Cheng-Lin Liu1,2
2022-04-10
会议名称Asian Conference on Pattern Recognition
会议日期2021-12-09
会议地点Jeju Island, South Korea
出版者Lecture Notes in Computer Science
摘要

In this paper we introduce an online handwritten document database (CASIA-onDo), serving as a standard database for the development and evaluation of methods in the field of online handwritten document layout analysis. It consists of 2,012 documents including a total of 841,159 online strokes. The database, covering Chinese and English languages, was produced by 200 writers. Six types of contents occur in the documents, namely text, formulas, diagrams, tables, figures, and lists. The distribution of different types is close to the actual situation. Benefiting from detailed annotations, CASIA-onDo can support different tasks of layout analysis under online or offline settings. Firstly, based on the semantic level annotation, it can be used for many classification tasks such as text/non-text classification, table/non-table classification, multi-class stroke classification and so on. Secondly, based on the instance level annotation, it can be used for segmentation tasks such as text line separation and formula segmentation. Thirdly, based on the various writing styles, it can be used for handwriting recognition and writer clustering tasks. In addition, we perform preliminary experiments to provide a benchmark on this database with a state-of-the-art method. More techniques can be evaluated on this challenging database in the future.

关键词Online handwritten document Document layout analysis Stroke classification Database
收录类别EI
文献类型会议论文
条目标识符http://ir.ia.ac.cn/handle/173211/48859
专题多模态人工智能系统全国重点实验室_模式分析与学习
作者单位1.National Laboratory of Pattern Recognition, Institute of Automation of Chinese Academy of Sciences
2.School of Artificial Intelligence, University of Chinese Academy of Sciences
第一作者单位模式识别国家重点实验室
推荐引用方式
GB/T 7714
Yu-Ting Yang,Yan-Ming Zhang,Xiao-Long Yun,et al. CASIA-onDo: A New Database for Online Handwritten Document Analysis[C]:Lecture Notes in Computer Science,2022.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
CASIA-onDo A New Dat(2006KB)会议论文 开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Yu-Ting Yang]的文章
[Yan-Ming Zhang]的文章
[Xiao-Long Yun]的文章
百度学术
百度学术中相似的文章
[Yu-Ting Yang]的文章
[Yan-Ming Zhang]的文章
[Xiao-Long Yun]的文章
必应学术
必应学术中相似的文章
[Yu-Ting Yang]的文章
[Yan-Ming Zhang]的文章
[Xiao-Long Yun]的文章
相关权益政策
暂无数据
收藏/分享
文件名: CASIA-onDo A New Database for Online Handwritten Document Analysis.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。