CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Generalized zero-shot emotion recognition from body gestures 期刊论文
APPLIED INTELLIGENCE, 2021, 页码: 19
作者:  Wu, Jinting;  Zhang, Yujia;  Sun, Shiying;  Li, Qianzhong;  Zhao, Xiaoguang
Adobe PDF(2059Kb)  |  收藏  |  浏览/下载:299/61  |  提交时间:2021/12/28
Generalized zero-shot learning  Emotion recognition  Body gesture recognition  Prototype learning  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:280/35  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Learning to Model Relationships for Zero-Shot Video Classification 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 卷号: 43, 期号: 10, 页码: 3476-3491
作者:  Gao, Junyu;  Zhang, Tianzhu;  Xu, Changsheng
收藏  |  浏览/下载:249/0  |  提交时间:2021/11/04
Zero-shot video classification  graph neural networks  zero-shot learning  deep attention model  
Heterogeneous Relational Graph Neural Networks with Adaptive Objective for End-to-End Task-Oriented Dialogue 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2021, 卷号: 227, 期号: 2021, 页码: 107186
作者:  Liu, Qingbin;  Bai, Guirong;  He, Shizhu;  Liu, Cao;  Liu, Kang;  Zhao, Jun
Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:300/61  |  提交时间:2021/11/02
End-to-end task-oriented dialogue  Heterogeneous relational graph neural networks  Shared-private parameterization  Hierarchical attention mechanism  Adaptive objective  
DATA: Differentiable ArchiTecture Approximation With Distribution Guided Sampling 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 卷号: 43, 期号: 9, 页码: 2905-2920
作者:  Zhang, Xinbang;  Chang, Jianlong;  Guo, Yiwen;  Meng, Gaofeng;  Xiang, Shiming;  Lin, Zhouchen;  Pan, Chunhong
Adobe PDF(1346Kb)  |  收藏  |  浏览/下载:304/50  |  提交时间:2021/11/02
Computer architecture  Search problems  Optimization  Task analysis  Bridges  Binary codes  Estimation  Neural architecture search(NAS)  ensemble gumbel-softmax  distribution guided sampling  
Question-Guided Erasing-Based Spatiotemporal Attention Learning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 0
作者:  Liu, Fei;  Liu, Jing;  Hong, Richang;  Lu, Hanqing
Adobe PDF(3550Kb)  |  收藏  |  浏览/下载:318/78  |  提交时间:2022/01/27
video question answering  attention mechanism  metric learning  
Rethinking semantic-visual alignment in zero-shot object detection via a softplus margin focal loss 期刊论文
Neurocomputing, 2021, 卷号: 449, 页码: 117-135
作者:  Li, Qianzhong;  Zhang, Yujia;  Sun, Shiying;  Zhao, Xiaoguang;  Li, Kang;  Tan, Min
Adobe PDF(8753Kb)  |  收藏  |  浏览/下载:270/34  |  提交时间:2021/08/15
Zero-shot object detection  Softplus margin focal loss  Semantic-visual alignment  Auto-encoder architecture  
End -to -end video text detection with online tracking 期刊论文
PATTERN RECOGNITION, 2021, 卷号: 113, 页码: 12
作者:  Yu, Hongyuan;  Huang, Yan;  Pi, Lihong;  Zhang, Chengquan;  Li, Xuan;  Wang, Liang
Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:312/57  |  提交时间:2021/05/06
End-to-end  Video text detection  Online tracking  
Robot learning through observation via coarse-to-fine grained video summarization 期刊论文
APPLIED SOFT COMPUTING, 2021, 卷号: 99, 期号: /, 页码: 106913
作者:  Zhang, Yujia;  Li, Qianzhong;  Zhao, Xiaoguang;  Tan, Min
Adobe PDF(5989Kb)  |  收藏  |  浏览/下载:346/71  |  提交时间:2021/03/08
Robotic vision  Learning through observation  Coarse-to-fine video summarization  
Learning Aligned Image-Text Representations Using Graph Attentive Relational Network 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 期号: 30, 页码: 1840-1852
作者:  Jing, Ya;  Wang, Wei;  Wang, Liang;  Tan, Tieniu
Adobe PDF(4532Kb)  |  收藏  |  浏览/下载:317/51  |  提交时间:2021/03/08
Graph neural networks  Visualization  Semantics  Task analysis  Feature extraction  Annotations  Recurrent neural networks  Image-text matching  cross-modal retrieval  person search  graph neural network