CASIA OpenIR

浏览/检索结果: 共98条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:55/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:45/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:70/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
Knowledge-Embedded Mutual Guidance for Visual Reasoning 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 页码: 13
作者:  Zheng, Wenbo;  Yan, Lan;  Chen, Long;  Li, Qiang;  Wang, Fei-Yue
收藏  |  浏览/下载:90/0  |  提交时间:2023/11/16
Attention model  joint learning  knowledge embedding  visual reasoning  
Multi-Source Knowledge Reasoning Graph Network for Multi-Modal Commonsense Inference 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 卷号: 19, 期号: 4, 页码: 17
作者:  Ma, Xuan;  Yang, Xiaoshan;  Xu, Changsheng
收藏  |  浏览/下载:57/0  |  提交时间:2023/11/17
Knowledge reasoning  multi-modal commonsense inference  graph neural network  
Dynamic High-Resolution Network for Semantic Segmentation in Remote-Sensing Images 期刊论文
REMOTE SENSING, 2023, 卷号: 15, 期号: 9, 页码: 28
作者:  Guo, Shichen;  Yang, Qi;  Xiang, Shiming;  Wang, Pengfei;  Wang, Xuezhi
收藏  |  浏览/下载:28/0  |  提交时间:2023/11/17
semantic segmentation  remote-sensing image  neural architecture search  sparse regularization  HRNet  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:117/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Enhanced Rolling Horizon Evolution Algorithm With Opponent Model Learning: Results for the Fighting Game AI Competition 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 5, 期号: 1, 页码: 5 - 15
作者:  Zhentao Tang;  Yuanheng Zhu;  Dongbin Zhao;  Simon M. Lucas
Adobe PDF(7686Kb)  |  收藏  |  浏览/下载:216/60  |  提交时间:2021/07/05
Rolling horizon evolution  opponent model  reinforcement learning  supervised learning  fighting game  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:88/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training