CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共3条,第1-3条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Unbiased Visual Question Answering by Leveraging Instrumental Variable 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6648-6662
作者:  Pan, Yonghua;  Liu, Jing;  Jin, Lu;  Li, Zechao
收藏  |  浏览/下载:1/0  |  提交时间:2024/07/22
Visualization  Correlation  Instruments  Training  Predictive models  Color  Generators  Visual question answering  instrumental variable  causal inference  out of distribution  
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6906-6916
作者:  Wang, Wenxuan;  He, Xingjian;  Zhang, Yisi;  Guo, Longteng;  Shen, Jiachen;  Li, Jiangyun;  Liu, Jing
收藏  |  浏览/下载:5/0  |  提交时间:2024/07/03
Referring image segmentation  cross-modality guidance  masked self-distillation  vision and language  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:169/36  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer