Knowledge Commons of Institute of Automation,CAS
Text Detection, Tracking and Recognition in Video: A Comprehensive Survey | |
Yin, Xu-Cheng1,2; Zuo, Ze-Yu3; Tian, Shu3; Liu, Cheng-Lin4 | |
发表期刊 | IEEE TRANSACTIONS ON IMAGE PROCESSING |
2016-06-01 | |
卷号 | 25期号:6页码:2752-2773 |
文章类型 | Article |
摘要 | The intelligent analysis of video data is currently in wide demand because a video is a major source of sensory data in our lives. Text is a prominent and direct source of information in video, while the recent surveys of text detection and recognition in imagery focus mainly on text extraction from scene images. Here, this paper presents a comprehensive survey of text detection, tracking, and recognition in video with three major contributions. First, a generic framework is proposed for video text extraction that uniformly describes detection, tracking, recognition, and their relations and interactions. Second, within this framework, a variety of methods, systems, and evaluation protocols of video text extraction are summarized, compared, and analyzed. Existing text tracking techniques, tracking-based detection and recognition techniques are specifically highlighted. Third, related applications, prominent challenges, and future directions for video text extraction (especially from scene videos and web videos) are also thoroughly discussed. |
关键词 | Text Tracking Tracking Based Text Detection Tracking Based Text Recognition Video Text Extraction Scene Text |
WOS标题词 | Science & Technology ; Technology |
DOI | 10.1109/TIP.2016.2554321 |
关键词[WOS] | NATURAL SCENE IMAGES ; COMPRESSED VIDEO ; PERFORMANCE EVALUATION ; NEURAL-NETWORKS ; LICENSE PLATES ; DIGITAL VIDEO ; EXTRACTION ; LOCALIZATION ; FEATURES ; FRAMES |
收录类别 | SCI |
语种 | 英语 |
项目资助者 | National Natural Science Foundation of China(61411136002 ; 61473036) |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000375303000002 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/12208 |
专题 | 多模态人工智能系统全国重点实验室_模式分析与学习 |
作者单位 | 1.Univ Sci & Technol Beijing, Dept Comp Sci & Technol, Beijing 100083, Peoples R China 2.Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing Key Lab Mat Sci Knowledge Engn, Beijing 100083, Peoples R China 3.Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Dept Comp Sci & Technol, Beijing 100083, Peoples R China 4.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China |
推荐引用方式 GB/T 7714 | Yin, Xu-Cheng,Zuo, Ze-Yu,Tian, Shu,et al. Text Detection, Tracking and Recognition in Video: A Comprehensive Survey[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2016,25(6):2752-2773. |
APA | Yin, Xu-Cheng,Zuo, Ze-Yu,Tian, Shu,&Liu, Cheng-Lin.(2016).Text Detection, Tracking and Recognition in Video: A Comprehensive Survey.IEEE TRANSACTIONS ON IMAGE PROCESSING,25(6),2752-2773. |
MLA | Yin, Xu-Cheng,et al."Text Detection, Tracking and Recognition in Video: A Comprehensive Survey".IEEE TRANSACTIONS ON IMAGE PROCESSING 25.6(2016):2752-2773. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论