CASIA OpenIR

浏览/检索结果: 共20条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:338/65  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
Single-Image Specular Highlight Removal via Real-World Dataset Construction 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 页码: 12
作者:  Wu ZQ(吴仲琦);  Zhuang CQ(庄传青);  Shi J(石剑);  Guo JW(郭建伟);  Xiao J(肖俊);  Zhang XP(张晓鹏);  Yan DM(严冬明)
Adobe PDF(49307Kb)  |  收藏  |  浏览/下载:247/95  |  提交时间:2022/06/15
Specular highlight removal, PSD-Dataset, Deep learning  
Exploring the Representativity of Art Paintings 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2794-2805
作者:  Deng, Yingying;  Tang, Fan;  Dong, Weiming;  Ma, Chongyang;  Huang, Feiyue;  Deussen, Oliver;  Xu, Changsheng
Adobe PDF(5313Kb)  |  收藏  |  浏览/下载:257/39  |  提交时间:2021/11/03
Painting  Art  Image color analysis  Feature extraction  Task analysis  Engineering profession  Electronic mail  Representativity  style enhancement  feature representation  artwork evaluation  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:328/45  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Online Multimodal Multiexpert Learning for Social Event Tracking 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 10, 页码: 2733-2748
作者:  Shengsheng Qian;  Tianzhu Zhang;  Changsheng Xu
浏览  |  Adobe PDF(4742Kb)  |  收藏  |  浏览/下载:319/95  |  提交时间:2019/09/25
Social Event Tracking  Topic Model  Social Media  Topic Evolution  Multimodality  
EgoGesture: A New Dataset and Benchmark for Egocentric Hand Gesture Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 5, 页码: 1038-1050
作者:  Zhang, Yifan;  Cao, Congqi;  Cheng, Jian;  Lu, Hanqing
浏览  |  Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:932/359  |  提交时间:2018/05/05
Benchmark  Dataset  Egocentric Vision  Gesture Recognition  First-person View  
Label Distribution-Based Facial Attractiveness Computation by Deep Residual Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 8, 页码: 2196-2208
作者:  Fan, Yang-Yu;  Liu, Shu;  Li, Bo;  Guo, Zhe;  Samal, Ashok;  Wan, Jun;  Li, Stan Z.
浏览  |  Adobe PDF(1377Kb)  |  收藏  |  浏览/下载:361/72  |  提交时间:2018/01/04
Facial attractiveness computation  deep residual network  label distribution  feature fusion  SCUT-FBP  
Multimodal Web Aesthetics Assessment Based on Structural SVM and Multitask Fusion Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 卷号: 18, 期号: 6, 页码: 1062-1076
作者:  Wu, Ou;  Zuo, Haiqiang;  Hu, Weiming;  Li, Bing;  Ou Wu
浏览  |  Adobe PDF(757Kb)  |  收藏  |  浏览/下载:516/175  |  提交时间:2016/10/20
Aesthetic Features  Fusion  Local Features  Multitask Learning  Visual Aesthetics  Web Pages  
Multi-Instance Multi-Label Learning Combining Hierarchical Context and its Application to Image Annotation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 卷号: 18, 期号: 8, 页码: 1616-1627
作者:  Ding, Xinmiao;  Li, Bing;  Xiong, Weihua;  Guo, Wen;  Hu, Weiming;  Wang, Bo
Adobe PDF(549Kb)  |  收藏  |  浏览/下载:444/148  |  提交时间:2016/10/20
Image Annotation  Instance Context  Label Context  Multi-instance  Multi-label  
Cross-Modal Retrieval via Deep and Bidirectional Representation Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 卷号: 18, 期号: 7, 页码: 1363-1377
作者:  He, Yonghao;  Xiang, Shiming;  Kang, Cuicui;  Wang, Jian;  Pan, Chunhong;  Xiang,Shiming
浏览  |  Adobe PDF(11388Kb)  |  收藏  |  浏览/下载:492/133  |  提交时间:2016/06/22
Bidirectional Modeling  Convolutional Neural Network  Cross-modal Retrieval  Representation Learning  Word Embedding