CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:415/4  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:376/75  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
SSAP: Single-Shot Instance Segmentation With Affinity Pyramid 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 卷号: 31, 期号: 2, 页码: 661-673
作者:  Gao, Naiyu;  Shan, Yanhu;  Wang, Yupei;  Zhao, Xin;  Huang, Kaiqi
Adobe PDF(4190Kb)  |  收藏  |  浏览/下载:322/53  |  提交时间:2021/03/29
Instance segmentation  panoptic segmentation  pixel-pair affinity  graph partition  
Skeleton-based action recognition with hierarchical spatial reasoning and temporal stack learning network 期刊论文
PATTERN RECOGNITION, 2020, 卷号: 107, 期号: 107511, 页码: 12
作者:  Si, Chenyang;  Jing, Ya;  Wang, Wei;  Wang, Liang;  Tan, Tieniu
Adobe PDF(2378Kb)  |  收藏  |  浏览/下载:389/74  |  提交时间:2020/08/31
Skeleton-based action recognition  Hierarchical spatial reasoning  Temporal stack learning  Clip-based incremental loss  
Long video question answering: A Matching-guided Attention Model 期刊论文
PATTERN RECOGNITION, 2020, 卷号: 102, 期号: 1, 页码: 11
作者:  Wang, Weining;  Huang, Yan;  Wang, Liang
浏览  |  Adobe PDF(1963Kb)  |  收藏  |  浏览/下载:396/80  |  提交时间:2020/06/02
Long video QA  Matching-guided attention  
Recurrent Prediction with Spatio-temporal Attention for Crowd Attribute Recognition 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2019, 卷号: 30, 期号: Early Access, 页码: 1 - 1
作者:  Li, Qiaozhe;  Zhao, Xin;  He, Ran;  Huang, Kaiqi
浏览  |  Adobe PDF(2648Kb)  |  收藏  |  浏览/下载:422/107  |  提交时间:2020/01/14
Crowd video understanding , Attribute recognition , Attention mechanism , Multi-label classification