CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:164/44  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:121/22  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Many Hands Make Light Work: Transferring Knowledge from Auxiliary Tasks for Video-Text Retrieval 期刊论文
IEEE Transactions on Multimedia, 2022, 页码: 1-15
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(3679Kb)  |  收藏  |  浏览/下载:100/19  |  提交时间:2023/04/25
Weakly-Supervised Video Object Grounding Via Learning Uni-Modal Associations 期刊论文
IEEE Transactions on Multimedia, 2022, 卷号: 25, 页码: 1-12
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(5406Kb)  |  收藏  |  浏览/下载:86/23  |  提交时间:2023/04/25
Visualization  Grounding  Task analysis  Prototypes  Annotations  Uncertainty  Proposals  Cross-modal retrieval  weakly-supervised learning  video object grounding  uni-modal association  
Dense Modality Interaction Network for Audio-Visual Event Localization 期刊论文
IEEE Transactions on Multimedia, 2022, 页码: 1-1
作者:  Liu, Shuo;  Quan, Weize;  Wang, Chaoqun;  Liu, Yuan;  Liu, Bin;  Yan, Dong-Ming
收藏  |  浏览/下载:29/0  |  提交时间:2023/04/25
Deep Modality Assistance Co-Training Network for Semi-Supervised Multi-Label Semantic Decoding 期刊论文
IEEE Transactions on Multimedia, 2022, 卷号: 24, 页码: 3287-3299
作者:  Dan Li;  Changde Du;  Haibao Wang;  Qiongyi Zhou;  Huiguang He
Adobe PDF(2627Kb)  |  收藏  |  浏览/下载:229/80  |  提交时间:2023/01/17
A Reconstruction-based Visual-Acoustic-Semantic Embedding Method for Speech-Image Retrieval 期刊论文
IEEE Transactions on Multimedia, 2022, 页码: 14
作者:  Cheng, Wenlong;  Tang, Wei;  Huang, Yan;  Luo, Yiwen;  Wang, Liang
Adobe PDF(1628Kb)  |  收藏  |  浏览/下载:233/87  |  提交时间:2022/06/14
CNDesc: Cross Normalization for Local Descriptors Learning 期刊论文
IEEE Transactions on Multimedia, 2022, 卷号: xx, 期号: xx, 页码: xx
作者:  Changwei Wang;  Rongtao Xu;  Shibiao Xu;  Weiliang Meng;  Xiaopeng Zhang
Adobe PDF(13183Kb)  |  收藏  |  浏览/下载:179/24  |  提交时间:2022/04/15
Local descriptors  cross normalization  densely connected backbone  distribution consistent loss  
You are What You Eat: Exploring Rich Recipe Information for Cross-Region Food Analysis 期刊论文
IEEE Transactions on Multimedia, 2017, 期号: 99, 页码: 1-1
作者:  Weiqing Min;  Bing-Kun Bao;  Shuhuan Mei;  Yaohui Zhu;  Yong Rui;  Shuqiang Jiang
浏览  |  Adobe PDF(26270Kb)  |  收藏  |  浏览/下载:244/65  |  提交时间:2018/01/04
Non  
Unsupervised Web Topic Detection Using A Ranked Clustering-Like Pattern Across Similarity Cascades 期刊论文
IEEE Transactions on Multimedia, 2015, 卷号: 17, 期号: 6, 页码: 843-853
作者:  junbiao pang;  Fei Jia;  Chunjie Zhang;  Weigang Zhang;  Qingming Huang;  Baocai Yin
浏览  |  Adobe PDF(1761Kb)  |  收藏  |  浏览/下载:316/128  |  提交时间:2017/09/19
Maximal Clique  Poisson Deconvolution  Similarity Cascade  Unsupervised Ranking  Web Topic Detection