CASIA OpenIR  > 09年以前成果
Segmentation of mixed Chinese/English documents based on Chinese radicals recognition and complexity analysis in local segment pattern
Xia, Yong; Xiao, Bai-Hua; Wang, Chun-Heng; Li, Yao-Dong; Huang, DS; Li, K; Irwin, GW
2006
发表期刊INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION
卷号345页码:497-506
文章类型Article
摘要Segmentation based on character recognition is one of the most popular methods of segmenting mixed Chinese/English documents. However, the rejection to outliers is always the bottleneck of this method. A new method is provided to alleviate the problem in this paper. We will give language attribute of each segment as possible as we can and then merge or split segment according to the language attribute. First of all, we construct a mixed OCR engine for Chinese radical and English character and some English character-pairs. Furthermore, English negative samples are trained to improve the capability of rejection to outliers. Finally, language determination of segments based on the mixed OCR engine and complexity analysis of local pattern is conducted. Encouraging performance has been obtained according to the test results.
WOS标题词Science & Technology ; Technology
关键词[WOS]MULTILAYER PERCEPTRONS
收录类别ISTP ; SCI
语种英语
WOS研究方向Automation & Control Systems ; Computer Science ; Engineering
WOS类目Automation & Control Systems ; Computer Science, Artificial Intelligence ; Computer Science, Information Systems ; Engineering, Electrical & Electronic
WOS记录号WOS:000240385300051
引用统计
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/9235
专题09年以前成果
作者单位Chinese Acad Sci, Inst Automat, Beijing 100080, Peoples R China
推荐引用方式
GB/T 7714
Xia, Yong,Xiao, Bai-Hua,Wang, Chun-Heng,et al. Segmentation of mixed Chinese/English documents based on Chinese radicals recognition and complexity analysis in local segment pattern[J]. INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION,2006,345:497-506.
APA Xia, Yong.,Xiao, Bai-Hua.,Wang, Chun-Heng.,Li, Yao-Dong.,Huang, DS.,...&Irwin, GW.(2006).Segmentation of mixed Chinese/English documents based on Chinese radicals recognition and complexity analysis in local segment pattern.INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION,345,497-506.
MLA Xia, Yong,et al."Segmentation of mixed Chinese/English documents based on Chinese radicals recognition and complexity analysis in local segment pattern".INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION 345(2006):497-506.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Xia, Yong]的文章
[Xiao, Bai-Hua]的文章
[Wang, Chun-Heng]的文章
百度学术
百度学术中相似的文章
[Xia, Yong]的文章
[Xiao, Bai-Hua]的文章
[Wang, Chun-Heng]的文章
必应学术
必应学术中相似的文章
[Xia, Yong]的文章
[Xiao, Bai-Hua]的文章
[Wang, Chun-Heng]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。