A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos

doi:10.1109/TPAMI.2017.2692763

	A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos
	Tian, Shu; Yin, Xu-Cheng; Su, Ya; Hao, Hong-Wei
发表期刊	IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
	2018-03-01
卷号	40 期号:3 页码:542-554
文章类型	Article
摘要	Video text extraction plays an important role for multimedia understanding and retrieval. Most previous research efforts are conducted within individual frames. A few of recent methods, which pay attention to text tracking using multiple frames, however, do not effectively mine the relations among text detection, tracking and recognition. In this paper, we propose a generic Bayesian-based framework of Tracking based Text Detection And Recognition (T(2)DAR) from web videos for embedded captions, which is composed of three major components, i.e., text tracking, tracking based text detection, and tracking based text recognition. In this unified framework, text tracking is first conducted by tracking-by-detection. Tracking trajectories are then revised and refined with detection or recognition results. Text detection or recognition is finally improved with multi-frame integration. Moreover, a challenging video text (embedded caption text) database (USTB-VidTEXT) is constructed and publicly available. A variety of experiments on this dataset verify that our proposed approach largely improves the performance of text detection and recognition from web videos.
关键词	Video Text Extraction Text Tracking Tracking Based Text Detection Tracking Based Text Recognition Embedded Captions
WOS标题词	Science & Technology ; Technology
DOI	10.1109/TPAMI.2017.2692763
关键词[WOS]	NATURAL SCENE IMAGES ; READING TEXT ; SEGMENTATION ; EXTRACTION
收录类别	SCI
语种	英语
WOS研究方向	Computer Science ; Engineering
WOS类目	Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
WOS记录号	WOS:000424465900003
引用统计	被引频次：43[WOS] [WOS记录] [WOS相关记录]
文献类型	期刊论文
条目标识符	http://ir.ia.ac.cn/handle/173211/40787
专题	复杂系统认知与决策实验室_听觉模型与认知计算
推荐引用方式 GB/T 7714	Tian, Shu,Yin, Xu-Cheng,Su, Ya,et al. A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,2018,40(3):542-554.
APA	Tian, Shu,Yin, Xu-Cheng,Su, Ya,&Hao, Hong-Wei.(2018).A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos.IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,40(3),542-554.
MLA	Tian, Shu,et al."A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos".IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 40.3(2018):542-554.