Deep Contextual Stroke Pooling for Scene Character Recognition | |
Zhang, Zhong1,2; Wang, Hong1,2; Liu, Shuang1,2; Xiao, Baihua3 | |
发表期刊 | IEEE ACCESS |
2018 | |
卷号 | 6页码:16454-16463 |
文章类型 | Article |
摘要 | Characters, as a kind of symbols carrying rich semantic information, are composed of strokes arranged in a certain structure and are of great significance in our daily life. In this paper, we are concerned with the problem of scene character recognition, and study the problem from the perspective of feature representation. We propose a novel pooling method termed deep contextual stroke pooling (DCSP) for scene character recognition. The proposed DCSP discovers the most prominent stroke information by using stroke detectors and captures the spatial context of discriminative strokes by learning contextual factor. Specifically, we first utilize the convolutional summing map in one convolutional layer to select discriminative strokes and use the convolutional activation features of discriminative strokes to train stroke detectors. Then, we propose the contextual factor to represent the co-occurrence probability of the stroke and its location. Finally, in the response regions, we incorporate the contextual factor into the detector scores and obtain the deep contextual confidence vectors of scene characters. Extensive experiments are conducted on three databases, i.e., ICDAR2003, Chars74k, and SVIIN, and the experimental results demonstrate that our method achieves higher accuracies than the state-of-the-art methods. |
关键词 | Scene Character Recognition Deep Contextual Stroke Pooling Contextual Factor |
WOS标题词 | Science & Technology ; Technology |
DOI | 10.1109/ACCESS.2018.2817342 |
关键词[WOS] | TEXT RECOGNITION ; IMAGE ; REPRESENTATION ; GESTURES |
收录类别 | SCI |
语种 | 英语 |
项目资助者 | National Natural Science Foundation of China(61501327 ; Natural Science Foundation of Tianjin(17JCZDJC30600 ; Open Projects Program of the National Laboratory of Pattern Recognition(201700001 ; China Scholarship Council(201708120039 ; 61711530240) ; 15JCQNJC01700) ; 201800002) ; 201708120040) |
WOS研究方向 | Computer Science ; Engineering ; Telecommunications |
WOS类目 | Computer Science, Information Systems ; Engineering, Electrical & Electronic ; Telecommunications |
WOS记录号 | WOS:000429991600001 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/21998 |
专题 | 复杂系统管理与控制国家重点实验室_影像分析与机器视觉 |
作者单位 | 1.Tianjin Normal Univ, Tianjin Key Lab Wireless Mobile Commun & Power Tr, Tianjin 300387, Peoples R China 2.Tianjin Normal Univ, Coll Elect & Commun Engn, Tianjin 300387, Peoples R China 3.Chinese Acad Sci, Inst Automat, State Key Lab Management & Intelligent Control Co, Beijing 100190, Peoples R China |
推荐引用方式 GB/T 7714 | Zhang, Zhong,Wang, Hong,Liu, Shuang,et al. Deep Contextual Stroke Pooling for Scene Character Recognition[J]. IEEE ACCESS,2018,6:16454-16463. |
APA | Zhang, Zhong,Wang, Hong,Liu, Shuang,&Xiao, Baihua.(2018).Deep Contextual Stroke Pooling for Scene Character Recognition.IEEE ACCESS,6,16454-16463. |
MLA | Zhang, Zhong,et al."Deep Contextual Stroke Pooling for Scene Character Recognition".IEEE ACCESS 6(2018):16454-16463. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论