CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:300/75  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Graph-based Multimodal Ranking Models for Multimodal Summarization 期刊论文
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 卷号: 20, 期号: 4, 页码: 21
作者:  Zhu, Junnan;  Xiang, Lu;  Zhou, Yu;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(4193Kb)  |  收藏  |  浏览/下载:256/48  |  提交时间:2021/12/28
Multimodal summarization  single-modal  multimodal ranking  unsupervised  
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:234/46  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:198/28  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
Long video question answering: A Matching-guided Attention Model 期刊论文
PATTERN RECOGNITION, 2020, 卷号: 102, 期号: 1, 页码: 11
作者:  Wang, Weining;  Huang, Yan;  Wang, Liang
浏览  |  Adobe PDF(1963Kb)  |  收藏  |  浏览/下载:342/67  |  提交时间:2020/06/02
Long video QA  Matching-guided attention  
Structure Preserving Convolutional Attention for Image Captioning 期刊论文
APPLIED SCIENCES-BASEL, 2019, 卷号: 9, 期号: 14, 页码: 10
作者:  Lu, Shichen;  Hu, Ruimin;  Liu, Jing;  Guo, Longteng;  Zheng, Fei
Adobe PDF(2351Kb)  |  收藏  |  浏览/下载:259/37  |  提交时间:2019/12/16
image captioning  attention  spatial structure  deep learning  computer vision  
Scene text detection and recognition with advances in deep learning: a survey 期刊论文
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 卷号: 22, 期号: 2, 页码: 143-162
作者:  Liu, Xiyan;  Meng, Gaofeng;  Pan, Chunhong
Adobe PDF(2418Kb)  |  收藏  |  浏览/下载:275/30  |  提交时间:2019/07/11
Natural image  Text detection  Text recognition  Survey  
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 卷号: 31, 期号: 5, 页码: 996-1009
作者:  Li, Haoran;  Zhu, Junnan;  Ma, Cong;  Zhang, Jiajun;  Zong, Chengqing
浏览  |  Adobe PDF(2826Kb)  |  收藏  |  浏览/下载:376/98  |  提交时间:2019/07/12
Summarization  multimedia  multi-modal  cross-modal  natural language processing  computer vision  
Surface defect saliency of magnetic tile 期刊论文
The visual computer, 2018, 卷号: 34, 期号: 8, 页码: 1-12
作者:  Yibin Huang;  Congying qiu;  Kui yuan
浏览  |  Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:498/204  |  提交时间:2019/04/22
Saliency  Surface Defect  Inspection  
A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 卷号: 40, 期号: 3, 页码: 542-554
作者:  Tian, Shu;  Yin, Xu-Cheng;  Su, Ya;  Hao, Hong-Wei
收藏  |  浏览/下载:60/0  |  提交时间:2020/10/27
Video Text Extraction  Text Tracking  Tracking Based Text Detection  Tracking Based Text Recognition  Embedded Captions