CASIA OpenIR

浏览/检索结果: 共18条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:49/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:75/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
SLAN: Similarity-aware aggregation network for embedding out-of-knowledge-graph entities 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 186-196
作者:  Li, Mingda;  Sun, Zhengya;  Zhang, Wensheng
Adobe PDF(931Kb)  |  收藏  |  浏览/下载:295/49  |  提交时间:2022/06/10
Knowledge graph embedding  Out-of-knowledge-graph entities  Knowledge graph completion  Similarity search  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:310/62  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:307/75  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
KM4: Visual reasoning via Knowledge EmbeddingMemoryModel with MutualModulation 期刊论文
INFORMATION FUSION, 2021, 卷号: 67, 页码: 14-28
作者:  Zheng, Wenbo;  Yan, Lan;  Gou, Chao;  Wang, Fei-Yue
收藏  |  浏览/下载:274/0  |  提交时间:2021/03/01
Visual reasoning  Knowledge-based representation learning  Memory network  Knowledge embedding  
Learning Aligned Image-Text Representations Using Graph Attentive Relational Network 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 期号: 30, 页码: 1840-1852
作者:  Jing, Ya;  Wang, Wei;  Wang, Liang;  Tan, Tieniu
Adobe PDF(4532Kb)  |  收藏  |  浏览/下载:312/50  |  提交时间:2021/03/08
Graph neural networks  Visualization  Semantics  Task analysis  Feature extraction  Annotations  Recurrent neural networks  Image-text matching  cross-modal retrieval  person search  graph neural network  
Extracting Effective Image Attributes with Refined Universal Detection 期刊论文
SENSORS, 2021, 卷号: 21, 期号: 1, 页码: 16
作者:  Yu, Qiang;  Xiao, Xinyu;  Zhang, Chunxia;  Song, Lifei;  Pan, Chunhong
Adobe PDF(2391Kb)  |  收藏  |  浏览/下载:298/53  |  提交时间:2021/03/01
attribute extraction  Refined Universal Detection  word tree  image captioning  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:295/60  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:309/43  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy