CASIA OpenIR

浏览/检索结果: 共17条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:416/4  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:347/69  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:356/47  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Deep Multi-Modality Adversarial Networks for Unsupervised Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 9, 页码: 2419-2431
作者:  Ma, Xinhong;  Zhang, Tianzhu;  Xu, Changsheng
Adobe PDF(2142Kb)  |  收藏  |  浏览/下载:380/49  |  提交时间:2019/12/16
Unsupervised domain adaptation  triplet loss  stacked attention  multi-modality  social event recognition  
Online Multimodal Multiexpert Learning for Social Event Tracking 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 10, 页码: 2733-2748
作者:  Shengsheng Qian;  Tianzhu Zhang;  Changsheng Xu
浏览  |  Adobe PDF(4742Kb)  |  收藏  |  浏览/下载:349/100  |  提交时间:2019/09/25
Social Event Tracking  Topic Model  Social Media  Topic Evolution  Multimodality  
Text2Video: An End-to-end Learning Framework for Expressing Text With Videos 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 9, 页码: 2360-2370
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:544/148  |  提交时间:2018/02/07
Multimedia Storytelling  Video Analysis  Deep Learning  
Label Distribution-Based Facial Attractiveness Computation by Deep Residual Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 8, 页码: 2196-2208
作者:  Fan, Yang-Yu;  Liu, Shu;  Li, Bo;  Guo, Zhe;  Samal, Ashok;  Wan, Jun;  Li, Stan Z.
浏览  |  Adobe PDF(1377Kb)  |  收藏  |  浏览/下载:387/78  |  提交时间:2018/01/04
Facial attractiveness computation  deep residual network  label distribution  feature fusion  SCUT-FBP  
Cross-Modal Hashing via Rank-Order Preserving 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 卷号: 19, 期号: 3, 页码: 571-585
作者:  Kun Ding;  Bin Fan;  Chunlei Huo;  Shiming Xiang;  Chunhong Pan;  Huo CL(霍春雷)
Adobe PDF(1451Kb)  |  收藏  |  浏览/下载:861/336  |  提交时间:2016/10/24
Cross-modal Similarity Search  Cross-modal Hashing (Cmh)  Rank-order Preserving  
Multimodal Web Aesthetics Assessment Based on Structural SVM and Multitask Fusion Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 卷号: 18, 期号: 6, 页码: 1062-1076
作者:  Wu, Ou;  Zuo, Haiqiang;  Hu, Weiming;  Li, Bing;  Ou Wu
浏览  |  Adobe PDF(757Kb)  |  收藏  |  浏览/下载:544/184  |  提交时间:2016/10/20
Aesthetic Features  Fusion  Local Features  Multitask Learning  Visual Aesthetics  Web Pages  
Multi-Instance Multi-Label Learning Combining Hierarchical Context and its Application to Image Annotation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 卷号: 18, 期号: 8, 页码: 1616-1627
作者:  Ding, Xinmiao;  Li, Bing;  Xiong, Weihua;  Guo, Wen;  Hu, Weiming;  Wang, Bo
Adobe PDF(549Kb)  |  收藏  |  浏览/下载:472/158  |  提交时间:2016/10/20
Image Annotation  Instance Context  Label Context  Multi-instance  Multi-label