CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共34条,第1-10条 帮助

限定条件                                
已选(0)清除 条数/页:   排序方式:
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:82/1  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Self-supervised Calorie-aware Heterogeneous Graph Networks for Food Recommendation 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2023, 卷号: 19, 期号: 1s, 页码: 1-23
作者:  Song, Yaguang;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(1381Kb)  |  收藏  |  浏览/下载:210/66  |  提交时间:2023/06/12
Food recommendation  recipe calories  heterogeneous graph  selfsupervised learning  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:196/49  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:366/74  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
Exploring the Representativity of Art Paintings 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2794-2805
作者:  Deng, Yingying;  Tang, Fan;  Dong, Weiming;  Ma, Chongyang;  Huang, Feiyue;  Deussen, Oliver;  Xu, Changsheng
Adobe PDF(5313Kb)  |  收藏  |  浏览/下载:281/41  |  提交时间:2021/11/03
Painting  Art  Image color analysis  Feature extraction  Task analysis  Engineering profession  Electronic mail  Representativity  style enhancement  feature representation  artwork evaluation  
Adversarial Multimodal Network for Movie Story Question Answering 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 1744-1756
作者:  Yuan, Zhaoquan;  Sun, Siyuan;  Duan, Lixin;  Li, Changsheng;  Wu, Xiao;  Xu, Changsheng
收藏  |  浏览/下载:192/0  |  提交时间:2021/08/15
Knowledge discovery  Motion pictures  Visualization  Task analysis  Generators  Gallium nitride  Natural languages  Movie question answering  adversarial network  multimodal understanding  
Knowledge-driven Egocentric Multimodal Activity Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 卷号: 16, 期号: 4, 页码: 21
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyu;  Sang, Jitao;  Xu, Changsheng
Adobe PDF(1875Kb)  |  收藏  |  浏览/下载:423/67  |  提交时间:2021/03/08
Egocentric videos  wearable sensors  graph neural networks  
Knowledge-aware Attentive Wasserstein Adversarial Dialogue Response Generation 期刊论文
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 卷号: 11, 期号: 4, 页码: 20
作者:  Zhang, Yingying;  Fang, Quan;  Qian, Shengsheng;  Xu, Changsheng
Adobe PDF(1626Kb)  |  收藏  |  浏览/下载:350/65  |  提交时间:2021/01/06
Dialogue system  co-attention  adversarial learning  external knowledge  
Unified Cross-domain Classification via Geometric and Statistical Adaptations 期刊论文
PATTERN RECOGNITION, 2021, 卷号: 110, 页码: 9
作者:  Liu, Weifeng;  Li, Jinfeng;  Liu, Baodi;  Guan, Weili;  Zhou, Yicong;  Xu, Changsheng
收藏  |  浏览/下载:223/0  |  提交时间:2021/01/06
Domain adaptation  Statistical adaptation  Maximum mean discrepancy (MMD)  Geometric adaptation  Nystrom method  
Deep Multi-Modality Adversarial Networks for Unsupervised Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 9, 页码: 2419-2431
作者:  Ma, Xinhong;  Zhang, Tianzhu;  Xu, Changsheng
Adobe PDF(2142Kb)  |  收藏  |  浏览/下载:374/48  |  提交时间:2019/12/16
Unsupervised domain adaptation  triplet loss  stacked attention  multi-modality  social event recognition