Knowledge Commons of Institute of Automation,CAS
Character confidence based on N-best list for keyword spotting in online Chinese handwritten documents | |
Zhang, Heng; Wang, Da-Han; Liu, Cheng-Lin | |
发表期刊 | PATTERN RECOGNITION |
2014-05-01 | |
卷号 | 47期号:5页码:1880-1890 |
文章类型 | Article |
摘要 | In keyword spotting from handwritten documents by text query, the word similarity is usually computed by combining character similarities, which are desired to approximate the logarithm of the character probabilities. In this paper, we propose to directly estimate the posterior probability (also called confidence) of candidate characters based on the N-best paths from the candidate segmentation-recognition lattice. On evaluating the candidate segmentation-recognition paths by combining multiple contexts, the scores of the N-best paths are transformed to posterior probabilities using soft-max. The parameter of soft-max (confidence parameter) is estimated from the character confusion network, which is constructed by aligning different paths using a string matching algorithm. The posterior probability of a candidate character is the summation of the probabilities of the paths that pass through the candidate character. We compare the proposed posterior probability estimation method with some reference methods including the word confidence measure and the text line recognition method. Experimental results of keyword spotting on a large database CASIA-OLHWDB of unconstrained online Chinese handwriting demonstrate the effectiveness of the proposed method. (C) 2013 Elsevier Ltd. All rights reserved. |
关键词 | Online Chinese Handwritten Documents Keyword Spotting Posterior Probability N-best List Confidence Measure Confusion Network |
WOS标题词 | Science & Technology ; Technology |
关键词[WOS] | CONTINUOUS SPEECH RECOGNITION ; NETWORKS ; SEGMENTATION ; CONSENSUS ; STRATEGY |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000331667400009 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/3084 |
专题 | 多模态人工智能系统全国重点实验室_模式分析与学习 |
作者单位 | Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China |
第一作者单位 | 模式识别国家重点实验室 |
推荐引用方式 GB/T 7714 | Zhang, Heng,Wang, Da-Han,Liu, Cheng-Lin. Character confidence based on N-best list for keyword spotting in online Chinese handwritten documents[J]. PATTERN RECOGNITION,2014,47(5):1880-1890. |
APA | Zhang, Heng,Wang, Da-Han,&Liu, Cheng-Lin.(2014).Character confidence based on N-best list for keyword spotting in online Chinese handwritten documents.PATTERN RECOGNITION,47(5),1880-1890. |
MLA | Zhang, Heng,et al."Character confidence based on N-best list for keyword spotting in online Chinese handwritten documents".PATTERN RECOGNITION 47.5(2014):1880-1890. |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
PR-Nbest.pdf(3340KB) | 期刊论文 | 作者接受稿 | 开放获取 | CC BY-NC-SA | 浏览 下载 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论