CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 234-244
作者:  Wang, Jinguang;  Qian, Shengsheng;  Hu, Jun;  Hong, Richang
收藏  |  浏览/下载:6/0  |  提交时间:2024/03/26
Fake news detection  multi-modal learning  social media  
Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 1439-1451
作者:  Shen, Tianyu;  Li, Deqi;  Wang, Fei-Yue;  Huang, Hua
收藏  |  浏览/下载:25/0  |  提交时间:2023/11/17
Three-dimensional displays  Pose estimation  Feature extraction  Location awareness  Cameras  Semantics  Solid modeling  Human depth perceiving  multi-person 3d pose estimation  multi-scale representation  occlusion handling  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:306/42  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Metadata Connector: Exploiting Hashtag and Tag for Cross-OSN Event Search 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 510-523
作者:  Gao, Yuqi;  Sang, Jitao;  Fu, Chengpeng;  Wang, Zhengjia;  Ren, Tongwei;  Xu, Changsheng
收藏  |  浏览/下载:160/0  |  提交时间:2021/03/08
Twitter  Tagging  Videos  YouTube  Flickr  Games  Social media search  cross-OSN application  social multimedia  
Knowledge-Based Topic Model for Multi-Modal Social Event Analysis 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2098-2110
作者:  Xue, Feng;  Hong, Richang;  He, Xiangnan;  Wang, Jianwei;  Qian, Shengsheng;  Xu, Changsheng
收藏  |  浏览/下载:215/0  |  提交时间:2020/08/31
Analytical models  Knowledge based systems  Social networking (online)  Data mining  Data models  Internet  Knowledge engineering  Knowledge embedding  multi-modal  topic coherence  event classification  
Deep Hierarchical Encoder-Decoder Network for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 11, 页码: 2942-2956
作者:  Xiao, Xinyu;  Wang, Lingfeng;  Ding, Kun;  Xiang, Shiming;  Pan, Chunhong
收藏  |  浏览/下载:295/0  |  提交时间:2020/03/30
Visualization  Semantics  Hidden Markov models  Decoding  Logic gates  Training  Computer architecture  Deep hierarchical structure  encoder-decoder  LSTM  image captioning  retrieval  vision-sentence  
Weakly Semantic Guided Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 10, 页码: 2504-2517
作者:  Yu, Tingzhao;  Wang, Lingfeng;  Da, Cheng;  Gu, Huxiang;  Xiang, Shiming;  Pan, Chunhong
浏览  |  Adobe PDF(18774Kb)  |  收藏  |  浏览/下载:418/108  |  提交时间:2019/05/15
Semantic guided module  action recognition  cross domain  3D convolution  attention model  
Iterative Manifold Embedding Layer Learned by Incomplete Data for Large-Scale Image Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 6, 页码: 1551-1562
作者:  Xu, Jian;  Wang, Chunheng;  Qi, Chengzuo;  Shi, Cunzhao;  Xiao, Baihua
浏览  |  Adobe PDF(4727Kb)  |  收藏  |  浏览/下载:334/75  |  提交时间:2019/07/11
Iterative manifold embedding layer  image retrieval  incomplete data  
Being a Supercook: Joint Food Attributes and Multimodal Content Modeling for Recipe Retrieval and Exploration 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 卷号: 19, 期号: 5, 页码: 1100-1113
作者:  Min, Weiqing;  Jiang, Shuqiang;  Sang, Jitao;  Wang, Huayang;  Liu, Xinda;  Herranz, Luis
收藏  |  浏览/下载:155/0  |  提交时间:2017/09/12
Cuisine Classification  Recipe Image Retrieval  Ingredient Inference  Multitask Deep Belief Network  
Cross-Modal Retrieval via Deep and Bidirectional Representation Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 卷号: 18, 期号: 7, 页码: 1363-1377
作者:  He, Yonghao;  Xiang, Shiming;  Kang, Cuicui;  Wang, Jian;  Pan, Chunhong;  Xiang,Shiming
Adobe PDF(11388Kb)  |  收藏  |  浏览/下载:480/128  |  提交时间:2016/06/22
Bidirectional Modeling  Convolutional Neural Network  Cross-modal Retrieval  Representation Learning  Word Embedding