Knowledge Commons of Institute of Automation,CAS
Multi-Orientation Scene Text Detection with Adaptive Clustering | |
Yin, Xu-Cheng; Pei, Wei-Yi; Zhang, Jun; Hao, Hong-Wei | |
发表期刊 | IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE |
2015-09-01 | |
卷号 | 37期号:9;9页码:1930-1937 |
文章类型 | Article |
摘要 | Text detection in natural scene images is an important prerequisite for many content-based image analysis tasks, while most current research efforts only focus on horizontal or near horizontal scene text. In this paper, first we present a unified distance metric learning framework for adaptive hierarchical clustering, which can simultaneously learn similarity weights (to adaptively combine different feature similarities) and the clustering threshold (to automatically determine the number of clusters). Then, we propose an effective multi-orientation scene text detection system, which constructs text candidates by grouping characters based on this adaptive clustering. Our text candidates construction method consists of several sequential coarse-to-fine grouping steps: morphology-based grouping via single-link clustering, orientation-based grouping via divisive hierarchical clustering, and projection-based grouping also via divisive clustering. The effectiveness of our proposed system is evaluated on several public scene text databases, e.g., ICDAR Robust Reading Competition data sets (2011 and 2013), MSRA-TD500 and NEOCR. Specifically, on the multi-orientation text data set MSRA-TD500, the f measure of our system is 71 percent, much better than the state-of-the-art performance. We also construct and release a practical challenging multi-orientation scene text data set (USTB-SV1K), which is available at http://prir.ustb.edu.cn/TexStar/MOMV-text-detection/. |
关键词 | Scene Text Detection Multi-orientation Adaptive Hierarchical Clustering Coarse-to-fine Grouping |
WOS标题词 | Science & Technology ; Technology |
关键词[WOS] | LINE SEGMENTATION ; IMAGES ; DOCUMENTS ; CLASSIFICATION ; FRAMEWORK |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000359216600015 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/40849 |
专题 | 复杂系统认知与决策实验室_听觉模型与认知计算 |
推荐引用方式 GB/T 7714 | Yin, Xu-Cheng,Pei, Wei-Yi,Zhang, Jun,et al. Multi-Orientation Scene Text Detection with Adaptive Clustering[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,2015,37(9;9):1930-1937. |
APA | Yin, Xu-Cheng,Pei, Wei-Yi,Zhang, Jun,&Hao, Hong-Wei.(2015).Multi-Orientation Scene Text Detection with Adaptive Clustering.IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,37(9;9),1930-1937. |
MLA | Yin, Xu-Cheng,et al."Multi-Orientation Scene Text Detection with Adaptive Clustering".IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 37.9;9(2015):1930-1937. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论