CASIA OpenIR  > 模式识别国家重点实验室  > 模式分析与学习
Semantic-Aware Video Text Detection
Wei Feng1,2; Fei Yin1,2; Xu-Yao Zhang1,2; Cheng-Lin Liu1,2
2021-06
Conference NameInternaltional Conference on Computer Vision and Pattern Recogintion
Conference Date2021.6.19
Conference Place线上会议
Abstract

Most existing video text detection methods track texts with appearance features, which are easily influenced by the change of perspective and illumination. Compared with appearance features, semantic features are more robust cues for matching text instances. In this paper, we propose an end-to-end trainable video text detector that tracks texts based on semantic features. First, we introduce a new character center segmentation branch to extract semantic features, which encode the category and position of characters. Then we propose a novel appearance-semantic-geometry descriptor to track text instances, in which semantic features can improve the robustness against appearance changes. To overcome the lack of character-level annotations, we propose a novel weakly-supervised character center detection module, which \textbf{only uses word-level annotated real images} to generate character-level labels. The proposed method achieves state-of-the-art performance on three video text benchmarks ICDAR 2013 Video, Minetto and RT-1K, and two Chinese scene text benchmarks CASIA10K and MSRA-TD500.

Indexed ByEI
Document Type会议论文
Identifierhttp://ir.ia.ac.cn/handle/173211/45047
Collection模式识别国家重点实验室_模式分析与学习
Affiliation1.National Laboratory of Pattern Recognition, Institutaion of Automation of Chinese Academy of Sciences
2.School of Artificial Intelligence, University of Chinese Academy of Sciences
3.CAS Center for Excellence of Brain Science and Intelligence Technology
Recommended Citation
GB/T 7714
Wei Feng,Fei Yin,Xu-Yao Zhang,et al. Semantic-Aware Video Text Detection[C],2021.
Files in This Item: Download All
File Name/Size DocType Version Access License
03057.pdf(2299KB)会议论文 开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wei Feng]'s Articles
[Fei Yin]'s Articles
[Xu-Yao Zhang]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wei Feng]'s Articles
[Fei Yin]'s Articles
[Xu-Yao Zhang]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wei Feng]'s Articles
[Fei Yin]'s Articles
[Xu-Yao Zhang]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 03057.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.