CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共63条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
作者:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  收藏  |  浏览/下载:27/7  |  提交时间:2024/07/08
Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning 会议论文
, Chengdu, China, 2021-10
作者:  Zhang X(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(5740Kb)  |  收藏  |  浏览/下载:31/7  |  提交时间:2024/07/08
VQACL: A Novel Visual Question Answering Continual Learning Setting 会议论文
, Canada, 2023
作者:  Zhang X(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(1199Kb)  |  收藏  |  浏览/下载:29/6  |  提交时间:2024/07/08
Self-supervised Calorie-aware Heterogeneous Graph Networks for Food Recommendation 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2023, 卷号: 19, 期号: 1s, 页码: 1-23
作者:  Song, Yaguang;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(1381Kb)  |  收藏  |  浏览/下载:219/67  |  提交时间:2023/06/12
Food recommendation  recipe calories  heterogeneous graph  selfsupervised learning  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:202/50  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Many Hands Make Light Work: Transferring Knowledge from Auxiliary Tasks for Video-Text Retrieval 期刊论文
IEEE Transactions on Multimedia, 2022, 页码: 1-15
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(3679Kb)  |  收藏  |  浏览/下载:147/33  |  提交时间:2023/04/25
Weakly-supervised video object grounding via causal intervention 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 卷号: 45, 期号: 3, 页码: 3933 - 3948
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(4558Kb)  |  收藏  |  浏览/下载:150/61  |  提交时间:2023/04/25
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:422/6  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Attribute-Induced Bias Eliminating for Transductive Zero-Shot Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 1933-1942
作者:  Yao, Hantao;  Min, Shaobo;  Zhang, Yongdong;  Xu, Changsheng
收藏  |  浏览/下载:244/0  |  提交时间:2022/06/10
Semantics  Visualization  Bridges  Training  Knowledge transfer  Image recognition  Topology  Transductive Zero-Shot Learning  Graph Attribute Embedding  Attribute-Induced Bias Eliminating  Semantic-Visual Alignment  
Joint Expression Synthesis and Representation Learning for Facial Expression Recognition 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 3, 页码: 1681-1695
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(4827Kb)  |  收藏  |  浏览/下载:273/5  |  提交时间:2022/06/06
Face recognition  Task analysis  Generative adversarial networks  Image synthesis  Image recognition  Faces  Training  Facial expression recognition  facial image synthesis  generative adversarial network  representation learning