CASIA OpenIR  > 09年以前成果
Segmentation of mixed Chinese/English documents based on Chinese radicals recognition and complexity analysis in local segment pattern
Xia, Yong; Xiao, Bai-Hua; Wang, Chun-Heng; Li, Yao-Dong; Huang, DS; Li, K; Irwin, GW
Source PublicationINTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION
2006
Volume345Pages:497-506
SubtypeArticle
AbstractSegmentation based on character recognition is one of the most popular methods of segmenting mixed Chinese/English documents. However, the rejection to outliers is always the bottleneck of this method. A new method is provided to alleviate the problem in this paper. We will give language attribute of each segment as possible as we can and then merge or split segment according to the language attribute. First of all, we construct a mixed OCR engine for Chinese radical and English character and some English character-pairs. Furthermore, English negative samples are trained to improve the capability of rejection to outliers. Finally, language determination of segments based on the mixed OCR engine and complexity analysis of local pattern is conducted. Encouraging performance has been obtained according to the test results.
WOS HeadingsScience & Technology ; Technology
WOS KeywordMULTILAYER PERCEPTRONS
Indexed ByISTP ; SCI
Language英语
WOS Research AreaAutomation & Control Systems ; Computer Science ; Engineering
WOS SubjectAutomation & Control Systems ; Computer Science, Artificial Intelligence ; Computer Science, Information Systems ; Engineering, Electrical & Electronic
WOS IDWOS:000240385300051
Citation statistics
Cited Times:3[WOS]   [WOS Record]     [Related Records in WOS]
Document Type期刊论文
Identifierhttp://ir.ia.ac.cn/handle/173211/9235
Collection09年以前成果
AffiliationChinese Acad Sci, Inst Automat, Beijing 100080, Peoples R China
Recommended Citation
GB/T 7714
Xia, Yong,Xiao, Bai-Hua,Wang, Chun-Heng,et al. Segmentation of mixed Chinese/English documents based on Chinese radicals recognition and complexity analysis in local segment pattern[J]. INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION,2006,345:497-506.
APA Xia, Yong.,Xiao, Bai-Hua.,Wang, Chun-Heng.,Li, Yao-Dong.,Huang, DS.,...&Irwin, GW.(2006).Segmentation of mixed Chinese/English documents based on Chinese radicals recognition and complexity analysis in local segment pattern.INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION,345,497-506.
MLA Xia, Yong,et al."Segmentation of mixed Chinese/English documents based on Chinese radicals recognition and complexity analysis in local segment pattern".INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION 345(2006):497-506.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Xia, Yong]'s Articles
[Xiao, Bai-Hua]'s Articles
[Wang, Chun-Heng]'s Articles
Baidu academic
Similar articles in Baidu academic
[Xia, Yong]'s Articles
[Xiao, Bai-Hua]'s Articles
[Wang, Chun-Heng]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Xia, Yong]'s Articles
[Xiao, Bai-Hua]'s Articles
[Wang, Chun-Heng]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.