Text Detection, Tracking and Recognition in Video: A Comprehensive Survey

doi:10.1109/TIP.2016.2554321

CASIA OpenIR > 多模态人工智能系统全国重点实验室 > 模式分析与学习

	Text Detection, Tracking and Recognition in Video: A Comprehensive Survey
	Yin, Xu-Cheng 1,2; Zuo, Ze-Yu 3; Tian, Shu 3; Liu, Cheng-Lin4
发表期刊	IEEE TRANSACTIONS ON IMAGE PROCESSING
	2016-06-01
卷号	25 期号:6 页码:2752-2773
文章类型	Article
摘要	The intelligent analysis of video data is currently in wide demand because a video is a major source of sensory data in our lives. Text is a prominent and direct source of information in video, while the recent surveys of text detection and recognition in imagery focus mainly on text extraction from scene images. Here, this paper presents a comprehensive survey of text detection, tracking, and recognition in video with three major contributions. First, a generic framework is proposed for video text extraction that uniformly describes detection, tracking, recognition, and their relations and interactions. Second, within this framework, a variety of methods, systems, and evaluation protocols of video text extraction are summarized, compared, and analyzed. Existing text tracking techniques, tracking-based detection and recognition techniques are specifically highlighted. Third, related applications, prominent challenges, and future directions for video text extraction (especially from scene videos and web videos) are also thoroughly discussed.
关键词	Text Tracking Tracking Based Text Detection Tracking Based Text Recognition Video Text Extraction Scene Text
WOS标题词	Science & Technology ; Technology
DOI	10.1109/TIP.2016.2554321
关键词[WOS]	NATURAL SCENE IMAGES ; COMPRESSED VIDEO ; PERFORMANCE EVALUATION ; NEURAL-NETWORKS ; LICENSE PLATES ; DIGITAL VIDEO ; EXTRACTION ; LOCALIZATION ; FEATURES ; FRAMES
收录类别	SCI
语种	英语
项目资助者	National Natural Science Foundation of China(61411136002 ; 61473036)
WOS研究方向	Computer Science ; Engineering
WOS类目	Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
WOS记录号	WOS:000375303000002
引用统计	被引频次：123[WOS] [WOS记录] [WOS相关记录]
文献类型	期刊论文
条目标识符	http://ir.ia.ac.cn/handle/173211/12208
专题	多模态人工智能系统全国重点实验室_模式分析与学习
作者单位	1.Univ Sci & Technol Beijing, Dept Comp Sci & Technol, Beijing 100083, Peoples R China 2.Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing Key Lab Mat Sci Knowledge Engn, Beijing 100083, Peoples R China 3.Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Dept Comp Sci & Technol, Beijing 100083, Peoples R China 4.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
推荐引用方式 GB/T 7714	Yin, Xu-Cheng,Zuo, Ze-Yu,Tian, Shu,et al. Text Detection, Tracking and Recognition in Video: A Comprehensive Survey[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2016,25(6):2752-2773.
APA	Yin, Xu-Cheng,Zuo, Ze-Yu,Tian, Shu,&Liu, Cheng-Lin.(2016).Text Detection, Tracking and Recognition in Video: A Comprehensive Survey.IEEE TRANSACTIONS ON IMAGE PROCESSING,25(6),2752-2773.
MLA	Yin, Xu-Cheng,et al."Text Detection, Tracking and Recognition in Video: A Comprehensive Survey".IEEE TRANSACTIONS ON IMAGE PROCESSING 25.6(2016):2752-2773.