CASIA OpenIR

浏览/检索结果: 共17条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:273/58  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:303/60  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:316/43  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Weakly Semantic Guided Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 10, 页码: 2504-2517
作者:  Yu, Tingzhao;  Wang, Lingfeng;  Da, Cheng;  Gu, Huxiang;  Xiang, Shiming;  Pan, Chunhong
浏览  |  Adobe PDF(18774Kb)  |  收藏  |  浏览/下载:424/108  |  提交时间:2019/05/15
Semantic guided module  action recognition  cross domain  3D convolution  attention model  
Deep Multi-Modality Adversarial Networks for Unsupervised Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 9, 页码: 2419-2431
作者:  Ma, Xinhong;  Zhang, Tianzhu;  Xu, Changsheng
Adobe PDF(2142Kb)  |  收藏  |  浏览/下载:332/41  |  提交时间:2019/12/16
Unsupervised domain adaptation  triplet loss  stacked attention  multi-modality  social event recognition  
Three-Dimensional Attention-Based Deep Ranking Model for Video Highlight Detection 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 10, 页码: 2693-2705
作者:  Jiao,Yifan;  Li,Zhetao;  Huang,Shucheng;  Yang,Xiaoshan;  Liu,Bin;  Zhang,Tianzhu
浏览  |  Adobe PDF(4692Kb)  |  收藏  |  浏览/下载:512/183  |  提交时间:2018/10/10
Video Highlight Detection  Attention Model  Deep Ranking  
Text2Video: An End-to-end Learning Framework for Expressing Text With Videos 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 9, 页码: 2360-2370
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:502/138  |  提交时间:2018/02/07
Multimedia Storytelling  Video Analysis  Deep Learning  
EgoGesture: A New Dataset and Benchmark for Egocentric Hand Gesture Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 5, 页码: 1038-1050
作者:  Zhang, Yifan;  Cao, Congqi;  Cheng, Jian;  Lu, Hanqing
浏览  |  Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:921/355  |  提交时间:2018/05/05
Benchmark  Dataset  Egocentric Vision  Gesture Recognition  First-person View  
Online Multimodal Multiexpert Learning for Social Event Tracking 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 10, 页码: 2733-2748
作者:  Shengsheng Qian;  Tianzhu Zhang;  Changsheng Xu
浏览  |  Adobe PDF(4742Kb)  |  收藏  |  浏览/下载:311/92  |  提交时间:2019/09/25
Social Event Tracking  Topic Model  Social Media  Topic Evolution  Multimodality  
Multi-Instance Multi-Label Learning Combining Hierarchical Context and its Application to Image Annotation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 卷号: 18, 期号: 8, 页码: 1616-1627
作者:  Ding, Xinmiao;  Li, Bing;  Xiong, Weihua;  Guo, Wen;  Hu, Weiming;  Wang, Bo
Adobe PDF(549Kb)  |  收藏  |  浏览/下载:433/143  |  提交时间:2016/10/20
Image Annotation  Instance Context  Label Context  Multi-instance  Multi-label