CASIA OpenIR

浏览/检索结果: 共23条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:46/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:72/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
Knowledge-Embedded Mutual Guidance for Visual Reasoning 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 页码: 13
作者:  Zheng, Wenbo;  Yan, Lan;  Chen, Long;  Li, Qiang;  Wang, Fei-Yue
收藏  |  浏览/下载:92/0  |  提交时间:2023/11/16
Attention model  joint learning  knowledge embedding  visual reasoning  
Medical visual question answering with symmetric interaction attention and cross-modal gating 期刊论文
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 卷号: 85, 页码: 10
作者:  Chen, Zhi;  Zou, Beiji;  Dai, Yulan;  Zhu, Chengzhang;  Kong, Guilan;  Zhang, Wensheng
收藏  |  浏览/下载:73/0  |  提交时间:2023/11/17
Medical visual question answering  Self-attention  Information interaction  Cross-modal gating  
Hierarchical Attention Networks for Fact-based Visual Question Answering 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 页码: 18
作者:  Yao, Haibo;  Luo, Yongkang;  Zhang, Zhi;  Yang, Jianhang;  Cai, Chengtao
收藏  |  浏览/下载:60/0  |  提交时间:2023/11/17
Fact-based Visual Question Answering  Hierarchical attention networks  Self-attention  Multiple attention interaction  Positional encoding  
SiamON: Siamese Occlusion-Aware Network for Visual Tracking 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 1, 页码: 186-199
作者:  Fan, Chao;  Yu, Hongyuan;  Huang, Yan;  Shan, Caifeng;  Wang, Liang;  Li, Chenglong
收藏  |  浏览/下载:88/0  |  提交时间:2023/03/20
Visual tracking  occlusion aware  attention  siamese network  
Geometry Sensitive Cross-Modal Reasoning for Composed Query Based Image Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 1000-1011
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:226/0  |  提交时间:2022/02/16
Visualization  Image retrieval  Semantics  Cognition  Geometry  Task analysis  Electronic mail  Composed query based image retrieval  semantic gap  spatial structure  inter-modal attention  text-guided visual reasoning  
Glaucoma Detection with Retinal Fundus Images Using Segmentation and Classification 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 563-580
作者:  Thisara Shyamalee;  Dulani Meedeniya
Adobe PDF(3581Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/04/23
Attention U-Net  segmentation  classification  Inception-v3  visual geometry group 19 (VGG19)  residual neural network 50 (ResNet50)  glaucoma  fundus images  
Object Reconstruction Based on Attentive Recurrent Network from Single and Multiple Images 期刊论文
NEURAL PROCESSING LETTERS, 2021, 期号: 53, 页码: 18
作者:  Gao, Zishu;  Li, En;  Wang, Zhe;  Yang, Guodong;  Lu, Jiwu;  Ouyang, Bo;  Xu, Dawei;  Liang, Zize
Adobe PDF(1338Kb)  |  收藏  |  浏览/下载:248/50  |  提交时间:2021/03/01
Object reconstruction  Convolutional LSTM  Visual attention  Robotic application  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:261/58  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions