CASIA OpenIR

浏览/检索结果: 共40条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:47/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:74/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
Knowledge-Embedded Mutual Guidance for Visual Reasoning 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 页码: 13
作者:  Zheng, Wenbo;  Yan, Lan;  Chen, Long;  Li, Qiang;  Wang, Fei-Yue
收藏  |  浏览/下载:94/0  |  提交时间:2023/11/16
Attention model  joint learning  knowledge embedding  visual reasoning  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:118/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Can Digital Intelligence and Cyber-Physical-Social Systems Achieve Global Food Security and Sustainability? 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 11, 页码: 2070-2080
作者:  Yanfen Wang;  Mengzhen Kang;  Yali Liu;  Juanjuan Li;  Kai Xue;  Xiujuan Wang;  Jianqing Du;  Yonglin Tian;  Qinghua Ni;  Fei-Yue Wang
Adobe PDF(9632Kb)  |  收藏  |  浏览/下载:189/108  |  提交时间:2023/09/22
Carbon-water balance  decision-support  digital intelligence (DI)  foundation models  planning  
Auditory Feature Driven Model Predictive Control for Sound Source Approaching 期刊论文
International Journal of Control, Automation, and Systems, 2023, 卷号: 22, 期号: 2, 页码: 1-14
作者:  Wang, Zhiqing;  Zou, Wei;  Zhang, Wei;  Ma, Hongxuan;  Zhang, Chi;  Guo, Yuxin
Adobe PDF(7966Kb)  |  收藏  |  浏览/下载:167/47  |  提交时间:2023/06/20
Source approaching control, interaural time difference, robotic audition, sound source localization.  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:92/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
ASCL: Adversarial supervised contrastive learning for defense against word substitution attacks 期刊论文
NEUROCOMPUTING, 2022, 卷号: 510, 页码: 59-68
作者:  Shi, Jiahui;  Li, Linjing;  Zeng, Daniel
Adobe PDF(1054Kb)  |  收藏  |  浏览/下载:215/25  |  提交时间:2022/11/14
Adversarial example  Adversarial training  Model robustness  Contrastive learning  Natural language processing  
Navigating Diverse Salient Features for Vehicle Re-Identification 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 页码: 10
作者:  Qian, Wen;  He, Zhiqun;  Chen, Chen;  Peng, Silong
Adobe PDF(795Kb)  |  收藏  |  浏览/下载:260/51  |  提交时间:2022/09/19
Navigation  Task analysis  Image color analysis  Boosting  Feature extraction  Benchmark testing  Space vehicles  Vehicle re-identification  suppress-and-explore mode  grid-based salient navigation  cross-space constraints