CASIA OpenIR

浏览/检索结果: 共42条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:66/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:44/13  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Accurate Lung Nodule Segmentation With Detailed Representation Transfer and Soft Mask Supervision 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Wang, Changwei;  Xu, Rongtao;  Xu, Shibiao;  Meng, Weiliang;  Xiao, Jun;  Zhang, Xiaopeng
Adobe PDF(4178Kb)  |  收藏  |  浏览/下载:114/1  |  提交时间:2023/12/21
Detailed representation transfer  lung nodules segmentation  medical images segmentation  soft mask  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:89/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:105/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
Knowledge-Embedded Mutual Guidance for Visual Reasoning 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 页码: 13
作者:  Zheng, Wenbo;  Yan, Lan;  Chen, Long;  Li, Qiang;  Wang, Fei-Yue
收藏  |  浏览/下载:109/0  |  提交时间:2023/11/16
Attention model  joint learning  knowledge embedding  visual reasoning  
Can Digital Intelligence and Cyber-Physical-Social Systems Achieve Global Food Security and Sustainability? 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 11, 页码: 2070-2080
作者:  Yanfen Wang;  Mengzhen Kang;  Yali Liu;  Juanjuan Li;  Kai Xue;  Xiujuan Wang;  Jianqing Du;  Yonglin Tian;  Qinghua Ni;  Fei-Yue Wang
Adobe PDF(9632Kb)  |  收藏  |  浏览/下载:210/117  |  提交时间:2023/09/22
Carbon-water balance  decision-support  digital intelligence (DI)  foundation models  planning  
Auditory Feature Driven Model Predictive Control for Sound Source Approaching 期刊论文
International Journal of Control, Automation, and Systems, 2023, 卷号: 22, 期号: 2, 页码: 1-14
作者:  Wang, Zhiqing;  Zou, Wei;  Zhang, Wei;  Ma, Hongxuan;  Zhang, Chi;  Guo, Yuxin
Adobe PDF(7966Kb)  |  收藏  |  浏览/下载:183/51  |  提交时间:2023/06/20
Source approaching control, interaural time difference, robotic audition, sound source localization.  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:136/24  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Decentralized Autonomous Operations and Organizations in TransVerse: Federated Intelligence for Smart Mobility 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 卷号: 53, 期号: 4, 页码: 2062-2072
作者:  Zhao, Chen;  Dai, Xingyuan;  Lv, Yisheng;  Niu, Jinglong;  Lin, Yilun
Adobe PDF(1921Kb)  |  收藏  |  浏览/下载:206/1  |  提交时间:2023/02/22
Intelligent Transportation Systems (ITS)  Artificial Systems, Computational Experiments, Parallel Execution (ACP)  Cyber–Physical–Social Systems (CPSS)