CASIA OpenIR  > 多模态人工智能系统全国重点实验室  > 多媒体计算
Browse Items

Browse/Search Results:  1-10 of 626 Help

Filters                
Selected(0)Clear Items/Page:    Sort:
Prompting Large Language Models for Automatic Question Tagging 期刊论文
Machine Intelligence Research, 2024, 页码: 0
Authors:  Nuojia Xu;  Dizhan Xue;  Shengsheng Qian;  Quan Fang;  Jun Hu
Adobe PDF(1493Kb)  |  Favorite  |  View/Download:8/2  |  Submit date:2024/06/04
Community Question Answering  Machine Learning  Large Language Model  Prompt Learning  Question Tagging  
Tri-relational multi-faceted graph neural networks for automatic question tagging 期刊论文
Neurocomputing, 2024, 卷号: 576, 页码: 127250
Authors:  Nuojia Xu;  Jun Hu;  Quan Fang;  Dizhan Xue;  Yongxi Li;  Shengsheng Qian
Adobe PDF(2105Kb)  |  Favorite  |  View/Download:6/2  |  Submit date:2024/06/04
Graph Neural Networks  Community Question Answering  Question Tagging  
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
Authors:  Xiao, Linhui;  Yang, Xiaoshan;  Peng, Fang;  Yan, Ming;  Wang, Yaowei;  Xu, Changsheng
Favorite  |  View/Download:12/0  |  Submit date:2024/05/30
Grounding  Reliability  Adaptation models  Task analysis  Visualization  Data models  Annotations  Visual grounding  curriculum learning  pseudo-language label  and vision-language models  
Semantic-Context Graph Network for Point-Based 3D Object Detection 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6474-6486
Authors:  Dong, Shuwei;  Kong, Xiaoyu;  Pan, Xingjia;  Tang, Fan;  Li, Wei;  Chang, Yi;  Dong, Weiming
Favorite  |  View/Download:114/0  |  Submit date:2023/12/21
3D object detection  graph neural networks  information entanglement  
A Unified Arbitrary Style Transfer Framework via Adaptive Contrastive Learning 期刊论文
ACM TRANSACTIONS ON GRAPHICS, 2023, 卷号: 42, 期号: 5, 页码: 16
Authors:  Zhang, Yuxin;  Tang, Fan;  Dong, Weiming;  Huang, Haibin;  Ma, Chongyang;  Lee, Tong-Yee;  Xu, Changsheng
Favorite  |  View/Download:124/0  |  Submit date:2023/12/21
Arbitrary style transfer  contrastive learning  style encoding  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
Authors:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Favorite  |  View/Download:62/0  |  Submit date:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
跨模态多视角自监督的个性化食谱推荐异构图网络 期刊论文
计算机辅助设计与图形学学报, 2023, 卷号: 35, 期号: 3, 页码: 413-422
Authors:  宋亚光;  杨小汕;  徐常胜
Adobe PDF(854Kb)  |  Favorite  |  View/Download:206/79  |  Submit date:2023/06/26
食物推荐  异构图  自监督学习  多视角  跨模态  
Relative Alignment Network for Source-Free Multimodal Video Domain Adaptation 会议论文
MM '22: Proceedings of the 30th ACM International Conference on Multimedia, Lisboa, Portugal, 2022.10.10—2022.10.14
Authors:  Huang Yi;  Yang Xiaoshan;  Zhang Ji;  Xu Changsheng
Adobe PDF(1264Kb)  |  Favorite  |  View/Download:193/80  |  Submit date:2023/06/21
Self-supervised Calorie-aware Heterogeneous Graph Networks for Food Recommendation 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2023, 卷号: 19, 期号: 1s, 页码: 1-23
Authors:  Song, Yaguang;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(1381Kb)  |  Favorite  |  View/Download:191/61  |  Submit date:2023/06/12
Food recommendation  recipe calories  heterogeneous graph  selfsupervised learning  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
Authors:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  Favorite  |  View/Download:185/47  |  Submit date:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation