CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:118/27  |  提交时间:2023/06/21
Human Parsing With Part-Aware Relation Modeling 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 25, 页码: 2601-2612
作者:  Zhang, Xiaomei;  Chen, Yingying;  Tang, Ming;  Wang, Jinqiao;  Zhu, Xiangyu;  Lei, Zhen
Adobe PDF(6053Kb)  |  收藏  |  浏览/下载:93/4  |  提交时间:2023/11/17
Human parsing  modeling  part-aware relation  
Generalized zero-shot emotion recognition from body gestures 期刊论文
APPLIED INTELLIGENCE, 2021, 页码: 19
作者:  Wu, Jinting;  Zhang, Yujia;  Sun, Shiying;  Li, Qianzhong;  Zhao, Xiaoguang
Adobe PDF(2059Kb)  |  收藏  |  浏览/下载:264/55  |  提交时间:2021/12/28
Generalized zero-shot learning  Emotion recognition  Body gesture recognition  Prototype learning  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:247/32  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:290/74  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering 会议论文
, 线上, 2021-10
作者:  Liu, Fei;  Liu, Jing;  Wang, Weining;  Lu, Hanqing
Adobe PDF(1174Kb)  |  收藏  |  浏览/下载:167/34  |  提交时间:2022/06/15
Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering 会议论文
, 线上, 2020-10
作者:  Liu, Fei;  Liu, Jing;  Zhu, Xinxin;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2797Kb)  |  收藏  |  浏览/下载:306/165  |  提交时间:2022/06/15
Long video question answering: A Matching-guided Attention Model 期刊论文
PATTERN RECOGNITION, 2020, 卷号: 102, 期号: 1, 页码: 11
作者:  Wang, Weining;  Huang, Yan;  Wang, Liang
浏览  |  Adobe PDF(1963Kb)  |  收藏  |  浏览/下载:336/66  |  提交时间:2020/06/02
Long video QA  Matching-guided attention  
Improving short-text representation in convolutional networks by dependency parsing 期刊论文
KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 卷号: 61, 期号: 1, 页码: 463-484
作者:  Zhang, Siheng;  Zhang, Wensheng;  Niu, Jinghao
浏览  |  Adobe PDF(1426Kb)  |  收藏  |  浏览/下载:264/71  |  提交时间:2019/12/16
Convolutional neural network  Dependency parsing  Question answering system  Question classification  Semantic equivalence  
Blind image quality assessment via learnable attention-based pooling 期刊论文
PATTERN RECOGNITION, 2019, 卷号: 91, 页码: 332-344
作者:  Gu, Jie;  Meng, Gaofeng;  Xiang, Shiming;  Pan, Chunhong
浏览  |  Adobe PDF(3081Kb)  |  收藏  |  浏览/下载:479/182  |  提交时间:2019/05/15
Image quality assessment  Perceptual image quality  Visual attention  Convolutional neural network  Learnable pooling