Knowledge Commons of Institute of Automation,CAS
A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos | |
Tian, Shu; Yin, Xu-Cheng; Su, Ya; Hao, Hong-Wei | |
发表期刊 | IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE |
2018-03-01 | |
卷号 | 40期号:3页码:542-554 |
文章类型 | Article |
摘要 | Video text extraction plays an important role for multimedia understanding and retrieval. Most previous research efforts are conducted within individual frames. A few of recent methods, which pay attention to text tracking using multiple frames, however, do not effectively mine the relations among text detection, tracking and recognition. In this paper, we propose a generic Bayesian-based framework of Tracking based Text Detection And Recognition (T(2)DAR) from web videos for embedded captions, which is composed of three major components, i.e., text tracking, tracking based text detection, and tracking based text recognition. In this unified framework, text tracking is first conducted by tracking-by-detection. Tracking trajectories are then revised and refined with detection or recognition results. Text detection or recognition is finally improved with multi-frame integration. Moreover, a challenging video text (embedded caption text) database (USTB-VidTEXT) is constructed and publicly available. A variety of experiments on this dataset verify that our proposed approach largely improves the performance of text detection and recognition from web videos. |
关键词 | Video Text Extraction Text Tracking Tracking Based Text Detection Tracking Based Text Recognition Embedded Captions |
WOS标题词 | Science & Technology ; Technology |
DOI | 10.1109/TPAMI.2017.2692763 |
关键词[WOS] | NATURAL SCENE IMAGES ; READING TEXT ; SEGMENTATION ; EXTRACTION |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000424465900003 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/40787 |
专题 | 复杂系统认知与决策实验室_听觉模型与认知计算 |
推荐引用方式 GB/T 7714 | Tian, Shu,Yin, Xu-Cheng,Su, Ya,et al. A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,2018,40(3):542-554. |
APA | Tian, Shu,Yin, Xu-Cheng,Su, Ya,&Hao, Hong-Wei.(2018).A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos.IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,40(3),542-554. |
MLA | Tian, Shu,et al."A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos".IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 40.3(2018):542-554. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论