CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:175/47  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:259/48  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
A unified framework for multi-modal federated learning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 480, 页码: 110-118
作者:  Xiong, Baochen;  Yang, Xiaoshan;  Qi, Fan;  Xu, Changsheng
收藏  |  浏览/下载:249/0  |  提交时间:2022/06/06
Multi-modal  Federated learning  Co-attention  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:326/45  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Health Status Prediction with Local-Global Heterogeneous Behavior Graph 期刊论文
ACM Transactions on Multimedia Computing Communications and Applications, 2021, 卷号: 0, 期号: 0, 页码: 0
作者:  Ma, Xuan;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(1170Kb)  |  收藏  |  浏览/下载:240/67  |  提交时间:2021/06/16
Health Status Prediction  Graph Neural Networks  Individual Behavior  
Knowledge-driven Egocentric Multimodal Activity Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 卷号: 16, 期号: 4, 页码: 21
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyu;  Sang, Jitao;  Xu, Changsheng
Adobe PDF(1875Kb)  |  收藏  |  浏览/下载:369/49  |  提交时间:2021/03/08
Egocentric videos  wearable sensors  graph neural networks  
Discriminative Multimodal Embedding for Event Classication 期刊论文
Journal of Nerual Computing, 2017, 卷号: Volume, 期号: Issue, 页码: pp
作者:  Qi,Fan;  Yang,Xiaoshan;  Zhang,Tianzhu;  Xu,Changsheng
浏览  |  Adobe PDF(4696Kb)  |  收藏  |  浏览/下载:312/97  |  提交时间:2018/10/10
Event Classi cation  Multimodal Embedding  
Cross-Domain Feature Learning in Multimedia 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 卷号: 17, 期号: 1, 页码: 64-78
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng;  Xu CS(徐常胜)
浏览  |  Adobe PDF(3097Kb)  |  收藏  |  浏览/下载:371/96  |  提交时间:2016/06/27
Cross-domain  Deep Learning  Feature Learning  Multi-modal  
Automatic Visual Concept Learning for Social Event Understanding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 卷号: 17, 期号: 3, 页码: 346-358
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng;  Hossain, M. Shamim
浏览  |  Adobe PDF(2027Kb)  |  收藏  |  浏览/下载:375/117  |  提交时间:2015/09/21
Event Analysis  Video Recognition