Knowledge Commons of Institute of Automation,CAS
A robust approach to text line grouping in online handwritten Japanese documents | |
Zhou, Xiang-Dong; Wang, Da-Han; Liu, Cheng-Lin![]() | |
发表期刊 | PATTERN RECOGNITION
![]() |
2009-09-01 | |
卷号 | 42期号:9页码:2077-2088 |
文章类型 | Article |
摘要 | In this paper, we present an effective approach for grouping text lines in online handwritten Japanese documents by combining temporal and spatial information. With decision functions optimized by supervised learning, the approach has few artificial parameters and utilizes little prior knowledge. First, the strokes in the document are grouped into text line strings according to off-stroke distances. Each text line string, which may contain multiple lines, is segmented by optimizing a cost function trained by the minimum classification error (MCE) method. At the temporal merge stage, over-segmented text lines (caused by stroke classification errors) are merged with a support vector machine (SVM) classifier for making merge/non-merge decisions. Last, a spatial merge module corrects the segmentation errors caused by delayed strokes. Misclassified text/non-text strokes (stroke type classification precedes text line grouping) can be corrected at the temporal merge stage. To evaluate the performance of text line grouping, we provide a set of performance metrics for evaluating from multiple aspects. In experiments on a large number of free form documents in the Tokyo University of Agriculture and Technology (TUAT) Kondate database, the proposed approach achieves the entity detection metric (EDM) rate of 0.8992 and the edit-distance rate (EDR) of 0.1114. For grouping of pure text strokes, the performance reaches EDM of 0.9591 and EDR of 0.0669. (C) 2008 Elsevier Ltd. All rights reserved. |
关键词 | Online Handwritten Documents Text Line Grouping Mce Training Temporal Merge Spatial Merge |
WOS标题词 | Science & Technology ; Technology |
关键词[WOS] | SPEECH RECOGNITION |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000267089000036 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/3058 |
专题 | 多模态人工智能系统全国重点实验室_模式分析与学习 |
作者单位 | Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China |
第一作者单位 | 模式识别国家重点实验室 |
推荐引用方式 GB/T 7714 | Zhou, Xiang-Dong,Wang, Da-Han,Liu, Cheng-Lin. A robust approach to text line grouping in online handwritten Japanese documents[J]. PATTERN RECOGNITION,2009,42(9):2077-2088. |
APA | Zhou, Xiang-Dong,Wang, Da-Han,&Liu, Cheng-Lin.(2009).A robust approach to text line grouping in online handwritten Japanese documents.PATTERN RECOGNITION,42(9),2077-2088. |
MLA | Zhou, Xiang-Dong,et al."A robust approach to text line grouping in online handwritten Japanese documents".PATTERN RECOGNITION 42.9(2009):2077-2088. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论