CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共11条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Many Hands Make Light Work: Transferring Knowledge from Auxiliary Tasks for Video-Text Retrieval 期刊论文
IEEE Transactions on Multimedia, 2022, 页码: 1-15
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(3679Kb)  |  收藏  |  浏览/下载:109/20  |  提交时间:2023/04/25
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:255/48  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:321/44  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Knowledge-driven Egocentric Multimodal Activity Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 卷号: 16, 期号: 4, 页码: 21
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyu;  Sang, Jitao;  Xu, Changsheng
Adobe PDF(1875Kb)  |  收藏  |  浏览/下载:362/49  |  提交时间:2021/03/08
Egocentric videos  wearable sensors  graph neural networks  
Image Captioning by Asking Questions 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 卷号: 15, 期号: 2, 页码: 19
作者:  Yang, Xiaoshan;  XU, Changsheng
收藏  |  浏览/下载:309/0  |  提交时间:2019/12/16
Image captioning  visual question answering  attention networks  
Discriminative Multimodal Embedding for Event Classication 期刊论文
Journal of Nerual Computing, 2017, 卷号: Volume, 期号: Issue, 页码: pp
作者:  Qi,Fan;  Yang,Xiaoshan;  Zhang,Tianzhu;  Xu,Changsheng
浏览  |  Adobe PDF(4696Kb)  |  收藏  |  浏览/下载:311/97  |  提交时间:2018/10/10
Event Classi cation  Multimodal Embedding  
A Unified Framework for Multimodal Domain Adaptation 会议论文
, Seoul, Republic of Korea, October 22–26, 2018
作者:  Qi,Fan;  Yang,Xiaoshan;  Xu,Changsheng
浏览  |  Adobe PDF(3378Kb)  |  收藏  |  浏览/下载:719/336  |  提交时间:2018/10/10
Text2Video: An End-to-end Learning Framework for Expressing Text With Videos 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 9, 页码: 2360-2370
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:507/139  |  提交时间:2018/02/07
Multimedia Storytelling  Video Analysis  Deep Learning  
Deep Relative Tracking 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 卷号: 26, 期号: 4, 页码: 1845-1858
作者:  Gao, Junyu;  Zhang, Tianzhu;  Yang, Xiaoshan;  Xu, Changsheng;  Changsheng Xu
浏览  |  Adobe PDF(5252Kb)  |  收藏  |  浏览/下载:579/237  |  提交时间:2017/02/23
Visual Tracking  Deep Learning  Relative Model  
Semantic Feature Mining for Video Event Understanding 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2016, 卷号: 12, 期号: 4, 页码: 55:1-55:22
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(910Kb)  |  收藏  |  浏览/下载:485/159  |  提交时间:2016/12/26
Video Recognition  Event