CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:401/1  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
End -to -end video text detection with online tracking 期刊论文
PATTERN RECOGNITION, 2021, 卷号: 113, 页码: 12
作者:  Yu, Hongyuan;  Huang, Yan;  Pi, Lihong;  Zhang, Chengquan;  Li, Xuan;  Wang, Liang
Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:362/71  |  提交时间:2021/05/06
End-to-end  Video text detection  Online tracking  
FA-GAN: Face Augmentation GAN for Deformation-Invariant Face Recognition 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 卷号: 16, 期号: 0, 页码: 2341-2355
作者:  Luo, Mandi;  Cao, Jie;  Ma, Xin;  Zhang, Xiaoyu;  He, Ran
Adobe PDF(4742Kb)  |  收藏  |  浏览/下载:377/62  |  提交时间:2021/04/21
Face recognition  Strain  Geometry  Frequency division multiplexing  Training  Task analysis  Semantics  Face augmentation  deformation-invariant face recognition  face disentanglement  graph convolutional networks  
Learning Aligned Image-Text Representations Using Graph Attentive Relational Network 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 期号: 30, 页码: 1840-1852
作者:  Jing, Ya;  Wang, Wei;  Wang, Liang;  Tan, Tieniu
Adobe PDF(4532Kb)  |  收藏  |  浏览/下载:357/60  |  提交时间:2021/03/08
Graph neural networks  Visualization  Semantics  Task analysis  Feature extraction  Annotations  Recurrent neural networks  Image-text matching  cross-modal retrieval  person search  graph neural network  
Skeleton-based action recognition with hierarchical spatial reasoning and temporal stack learning network 期刊论文
PATTERN RECOGNITION, 2020, 卷号: 107, 期号: 107511, 页码: 12
作者:  Si, Chenyang;  Jing, Ya;  Wang, Wei;  Wang, Liang;  Tan, Tieniu
Adobe PDF(2378Kb)  |  收藏  |  浏览/下载:385/73  |  提交时间:2020/08/31
Skeleton-based action recognition  Hierarchical spatial reasoning  Temporal stack learning  Clip-based incremental loss  
Recurrent Prediction with Spatio-temporal Attention for Crowd Attribute Recognition 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2019, 卷号: 30, 期号: Early Access, 页码: 1 - 1
作者:  Li, Qiaozhe;  Zhao, Xin;  He, Ran;  Huang, Kaiqi
浏览  |  Adobe PDF(2648Kb)  |  收藏  |  浏览/下载:417/107  |  提交时间:2020/01/14
Crowd video understanding , Attribute recognition , Attention mechanism , Multi-label classification