CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:415/4  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Joint Learning in the Spatio-Temporal and Frequency Domains for Skeleton-Based Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 9, 页码: 2207-2220
作者:  Guyue, Hu;  Bo, Cui;  Shan, Yu
Adobe PDF(4803Kb)  |  收藏  |  浏览/下载:335/67  |  提交时间:2020/09/28
Skeleton-based Action Recognition  Frequency Attention  Synchronous Local and Non-local Learning  Soft-margin Focal Loss  Pesudo Multi-task Learning  
Text2Video: An End-to-end Learning Framework for Expressing Text With Videos 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 9, 页码: 2360-2370
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:544/148  |  提交时间:2018/02/07
Multimedia Storytelling  Video Analysis  Deep Learning  
Cross-Modal Retrieval via Deep and Bidirectional Representation Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 卷号: 18, 期号: 7, 页码: 1363-1377
作者:  He, Yonghao;  Xiang, Shiming;  Kang, Cuicui;  Wang, Jian;  Pan, Chunhong;  Xiang,Shiming
浏览  |  Adobe PDF(11388Kb)  |  收藏  |  浏览/下载:520/145  |  提交时间:2016/06/22
Bidirectional Modeling  Convolutional Neural Network  Cross-modal Retrieval  Representation Learning  Word Embedding