CASIA OpenIR

浏览/检索结果: 共126条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:39/7  |  提交时间:2024/02/23
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:45/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
VQAPT: A New visual question answering model for personality traits in social media images 期刊论文
PATTERN RECOGNITION LETTERS, 2023, 卷号: 175, 页码: 66-73
作者:  Biswas, Kunal;  Shivakumara, Palaiahnakote;  Pal, Umapada;  Liu, Cheng-Lin;  Lu, Yue
收藏  |  浏览/下载:32/0  |  提交时间:2024/02/22
Personality trait images  Multimodal concept  Text recognition  Social media images  Natural language processing  Visual question answering  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:70/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
Medical visual question answering with symmetric interaction attention and cross-modal gating 期刊论文
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 卷号: 85, 页码: 10
作者:  Chen, Zhi;  Zou, Beiji;  Dai, Yulan;  Zhu, Chengzhang;  Kong, Guilan;  Zhang, Wensheng
收藏  |  浏览/下载:70/0  |  提交时间:2023/11/17
Medical visual question answering  Self-attention  Information interaction  Cross-modal gating  
Hierarchical Attention Networks for Fact-based Visual Question Answering 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 页码: 18
作者:  Yao, Haibo;  Luo, Yongkang;  Zhang, Zhi;  Yang, Jianhang;  Cai, Chengtao
收藏  |  浏览/下载:58/0  |  提交时间:2023/11/17
Fact-based Visual Question Answering  Hierarchical attention networks  Self-attention  Multiple attention interaction  Positional encoding  
A Multi-Modal Neural Geometric Solver with Textual Clauses Parsed from Diagram 会议论文
, 中国 澳门, 2023-7-19
作者:  Zhang Ming-Liang;  Yin Fei;  Liu Cheng-Lin
Adobe PDF(1110Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/04/03
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:157/42  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Dense Attention: A Densely Connected Attention Mechanism for Vision Transformer 会议论文
, Queensland, Australia, June 18 - 23, 2023
作者:  Nannan Li;  Yaran Chen;  Dongbin Zhao
Adobe PDF(3683Kb)  |  收藏  |  浏览/下载:122/33  |  提交时间:2023/06/28
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:172/47  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer