CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:87/3  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:411/4  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Learning to Model Relationships for Zero-Shot Video Classification 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 卷号: 43, 期号: 10, 页码: 3476-3491
作者:  Gao, Junyu;  Zhang, Tianzhu;  Xu, Changsheng
收藏  |  浏览/下载:285/0  |  提交时间:2021/11/04
Zero-shot video classification  graph neural networks  zero-shot learning  deep attention model  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:355/47  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Knowledge-driven Egocentric Multimodal Activity Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 卷号: 16, 期号: 4, 页码: 21
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyu;  Sang, Jitao;  Xu, Changsheng
Adobe PDF(1875Kb)  |  收藏  |  浏览/下载:427/68  |  提交时间:2021/03/08
Egocentric videos  wearable sensors  graph neural networks  
Multi-Level Correlation Adversarial Hashing for Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 12, 页码: 3101-3114
作者:  Ma, Xinhong;  Zhang, Tianzhu;  Xu, Changsheng
Adobe PDF(4322Kb)  |  收藏  |  浏览/下载:345/66  |  提交时间:2021/03/01
Semantics  Correlation  Aircraft propulsion  Deep learning  Bridges  Aircraft  Task analysis  Cross-modal retrieval  adversarial hashing  multi-level correlation  
Multimodal graph convolutional networks for high quality content recognition 期刊论文
NEUROCOMPUTING, 2020, 卷号: 412, 页码: 42-51
作者:  Wang, Jinguang;  Hu, Jun;  Qian, Shengsheng;  Fang, Quan;  Xu, Changsheng
收藏  |  浏览/下载:257/0  |  提交时间:2021/01/07
High quality content recognition  Graph convolutional networks  Positive unlabeled learning  
Knowledge-aware Attentive Wasserstein Adversarial Dialogue Response Generation 期刊论文
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 卷号: 11, 期号: 4, 页码: 20
作者:  Zhang, Yingying;  Fang, Quan;  Qian, Shengsheng;  Xu, Changsheng
Adobe PDF(1626Kb)  |  收藏  |  浏览/下载:355/66  |  提交时间:2021/01/06
Dialogue system  co-attention  adversarial learning  external knowledge  
Deep Multi-Modality Adversarial Networks for Unsupervised Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 9, 页码: 2419-2431
作者:  Ma, Xinhong;  Zhang, Tianzhu;  Xu, Changsheng
Adobe PDF(2142Kb)  |  收藏  |  浏览/下载:378/49  |  提交时间:2019/12/16
Unsupervised domain adaptation  triplet loss  stacked attention  multi-modality  social event recognition