CASIA OpenIR

Browse/Search Results:  1-10 of 28 Help

Selected(0)Clear Items/Page:    Sort:
基于注意与记忆机制的视觉描述 学位论文
, 中国科学院自动化研究所: 中国科学院自动化研究所, 2019
Authors:  王君波
Adobe PDF(6335Kb)  |  Favorite  |  View/Download:42/1  |  Submit date:2020/01/07
视觉描述  注意与记忆机制  长序列建模  模态相关性  关系学习  
Cross-Modality Bridging and Knowledge Transferring for Image Understanding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 10, 页码: 2675-2685
Authors:  Yan, Chenggang;  Li, Liang;  Zhang, Chunjie;  Liu, Bingtao;  Zhang, Yongdong;  Dai, Qionghai
Favorite  |  View/Download:5/0  |  Submit date:2019/12/16
Object and scene recognition  image semantic search  cross-modality bridging  multi-task learning  knowledge transferring  
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 卷号: 31, 期号: 5, 页码: 996-1009
Authors:  Li, Haoran;  Zhu, Junnan;  Ma, Cong;  Zhang, Jiajun;  Zong, Chengqing
View  |  Adobe PDF(2826Kb)  |  Favorite  |  View/Download:30/5  |  Submit date:2019/07/12
Summarization  multimedia  multi-modal  cross-modal  natural language processing  computer vision  
Towards Sentence-Level Brain Decoding with Distributed Representations 会议论文
, Honolulu, Hawaii, USA, 2019
Authors:  Sun, Jingyuan;  Wang, Shaonan;  Zhang, Jiajun;  Zong, Chengqing
View  |  Adobe PDF(507Kb)  |  Favorite  |  View/Download:77/2  |  Submit date:2019/02/25
A Unified Framework for Multimodal Domain Adaptation 会议论文
, Seoul, Republic of Korea, October 22–26, 2018
Authors:  Qi,Fan;  Yang,Xiaoshan;  Xu,Changsheng
View  |  Adobe PDF(3378Kb)  |  Favorite  |  View/Download:270/183  |  Submit date:2018/10/10
RGB-D-based Human Motion Recognition with Deep Learning: A Survey 期刊论文
Computer Vision and Image Understanding, 2018, 卷号: PP, 期号: 1, 页码: 1-22
Authors:  Pichao Wang;  Wanqing Li;  Philip Ogunbona;  Jun Wan;  Sergio Escalera
View  |  Adobe PDF(5390Kb)  |  Favorite  |  View/Download:62/3  |  Submit date:2018/10/04
Human Motion Recognition  Rgb-d Data  Deep Learning  Survey  
Read, Watch, Listen and Summarize: Multi-modal Summarization for Asynchronous Text, Image, Audio and Video 期刊论文
IEEE Transactions on Knowledge and Data Engineering, 2018, 卷号: 1, 期号: 1, 页码: 1
Authors:  Li, Haoran;  Zhu, Junnan;  Ma, Cong;  Zhang, Jiajun;  Zong, Chengqing
View  |  Adobe PDF(2826Kb)  |  Favorite  |  View/Download:31/0  |  Submit date:2019/02/25
Summarization  
Read, Watch, Listen and Summarize: Multi-modal Summarization for Asynchronous Text, Image, Audio and Video 期刊论文
IEEE Transactions on Knowledge and Data Engineering, 2018, 卷号: 1, 期号: 1, 页码: 1
Authors:  Haoran Li;  Junnan Zhu;  Cong Ma;  Jiajun Zhang;  Chengqing Zong
View  |  Adobe PDF(2826Kb)  |  Favorite  |  View/Download:43/5  |  Submit date:2019/01/25
Multimedia  Summarization  Multi-modal  Cross-modal  Natural Language Processing  Computer Vision  
Integrating both Visual and Audio Cues for Enhanced Video Caption 会议论文
, Hilton New Orleans Riverside, American, 2018.2.1
Authors:  Wangli Hao;  Zhaoxiang Zhang;  He Guan
View  |  Adobe PDF(528Kb)  |  Favorite  |  View/Download:16/2  |  Submit date:2019/06/17
Discriminative Multimodal Embedding for Event Classication 期刊论文
Journal of Nerual Computing, 2017, 卷号: Volume, 期号: Issue, 页码: pp
Authors:  Qi,Fan;  Yang,Xiaoshan;  Zhang,Tianzhu;  Xu,Changsheng
View  |  Adobe PDF(4696Kb)  |  Favorite  |  View/Download:79/29  |  Submit date:2018/10/10
Event Classi cation  Multimodal Embedding