CASIA OpenIR

浏览/检索结果: 共57条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:160/43  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:118/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
XANet: An Efficient Remote Sensing Image Segmentation Model Using Element-Wise Attention Enhancement and Multi-Scale Attention Fusion 期刊论文
REMOTE SENSING, 2023, 卷号: 15, 期号: 1, 页码: 25
作者:  Liang, Chenbin;  Xiao, Baihua;  Cheng, Bo;  Dong, Yunyun
Adobe PDF(63859Kb)  |  收藏  |  浏览/下载:265/25  |  提交时间:2023/02/22
semantic segmentation  attention mechanism  cross-attention  feature fusion  
Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1-17
作者:  Du CD(杜长德);  Fu KC(付铠成);  Li JP(李劲鹏);  He HG(何晖光)
Adobe PDF(4669Kb)  |  收藏  |  浏览/下载:378/65  |  提交时间:2023/05/05
ArtCap: A Dataset for Image Captioning of Fine Art Paintings 期刊论文
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 页码: 12
作者:  Lu, Yue;  Guo, Chao;  Dai, Xingyuan;  Wang, Fei-Yue
Adobe PDF(5137Kb)  |  收藏  |  浏览/下载:221/43  |  提交时间:2023/02/22
Dataset construction  image captioning  painting captioning  
Dual-View Conditional Variational Auto-Encoder for Emotional Dialogue Generation 期刊论文
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 卷号: 21, 期号: 3, 页码: 18
作者:  Li, Mei;  Zhang, Jiajun;  Lu, Xiang;  Zong, Chengqing
Adobe PDF(1187Kb)  |  收藏  |  浏览/下载:272/55  |  提交时间:2022/06/10
Sentiment  dialogue  neural networks  
Learning Video Moment Retrieval Without a Single Annotated Video 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 3, 页码: 1646-1657
作者:  Gao, Junyu;  Xu, Changsheng
收藏  |  浏览/下载:195/0  |  提交时间:2022/06/06
Visualization  Task analysis  Generators  Training  Graph neural networks  Semantics  Detectors  Video moment retrieval  graph neural network  unpaired learning  
Using Pre-trained Language Model to Enhance Active Learning for Sentence Matching 期刊论文
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 卷号: 21, 期号: 2, 页码: 19
作者:  Bai, Guirong;  He, Shizhu;  Liu, Kang;  Zhao, Jun
Adobe PDF(4097Kb)  |  收藏  |  浏览/下载:255/52  |  提交时间:2022/06/10
Sentence matching  active learning  pre-trained language model  
Learning Hierarchical Video Graph Networks for One-Stop Video Delivery 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2022, 卷号: 18, 期号: 1, 页码: 1-23
作者:  Song, Yaguang;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(7608Kb)  |  收藏  |  浏览/下载:133/41  |  提交时间:2023/04/25
Cross modal  video retrieval  deep learning  graph neural networks  
Cross-Modality Synergy Network for Referring Expression Comprehension and Segmentation 期刊论文
Neurocomputing, 2022, 卷号: 467, 期号: /, 页码: 99-114
作者:  Li, Qianzhong;  Zhang, Yujia;  Sun, Shiying;  Wu, Jinting;  Zhao, Xiaoguang;  Tan, Min
Adobe PDF(4555Kb)  |  收藏  |  浏览/下载:301/44  |  提交时间:2021/12/28
Referring expression comprehension  Referring expression segmentation  Cross-modality synergy  Attention mechanism