CASIA OpenIR

浏览/检索结果: 共27条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:407/3  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:369/74  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
Instance GNN: A Learning Framework for Joint Symbol Segmentation and Recognition in Online Handwritten Diagrams 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2580-2594
作者:  Yun, Xiao-Long;  Zhang, Yan-Ming;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(3236Kb)  |  收藏  |  浏览/下载:316/3  |  提交时间:2022/07/25
Handwriting recognition  Task analysis  Grammar  Semantics  Image segmentation  Trajectory  Text recognition  Online handwritten diagram recognition  symbol segmentation  symbol recognition  freehand sketch analysis  graph neural networks  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:233/36  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 2, 页码: 380-393
作者:  Zhang, Shifeng;  Xie, Yiliang;  Wan, Jun;  Xia, Hansheng;  Li, Stan Z.;  Guo, Guodong
浏览  |  Adobe PDF(6651Kb)  |  收藏  |  浏览/下载:354/57  |  提交时间:2020/04/07
Benchmark testing  Detectors  Training  Urban areas  Cameras  Task analysis  Deep learning  Pedestrian detection  dataset  rich diversity  high density  
Enhancing Image Watermarking With Adaptive Embedding Parameter and PSNR Guarantee 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 10, 页码: 2447-2460
作者:  Huang, Ying;  Niu, Baoning;  Guan, Hu;  Zhang, Shuwu
浏览  |  Adobe PDF(2733Kb)  |  收藏  |  浏览/下载:414/92  |  提交时间:2019/12/16
Adaptive watermarking  differential quantization  image watermarking  spread spectrum  
Weakly Semantic Guided Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 10, 页码: 2504-2517
作者:  Yu, Tingzhao;  Wang, Lingfeng;  Da, Cheng;  Gu, Huxiang;  Xiang, Shiming;  Pan, Chunhong
浏览  |  Adobe PDF(18774Kb)  |  收藏  |  浏览/下载:460/114  |  提交时间:2019/05/15
Semantic guided module  action recognition  cross domain  3D convolution  attention model  
Multiview Label Sharing for Visual Representations and Classifications 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 4, 页码: 903-913
作者:  Zhang, Chunjie;  Cheng, Jian;  Tian, Qi
Adobe PDF(615Kb)  |  收藏  |  浏览/下载:381/107  |  提交时间:2018/10/10
Multi-view Learning  Linear Transformation  Shared Space  Image Representation  Visual Classification  
Text2Video: An End-to-end Learning Framework for Expressing Text With Videos 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 9, 页码: 2360-2370
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:540/147  |  提交时间:2018/02/07
Multimedia Storytelling  Video Analysis  Deep Learning  
Label Distribution-Based Facial Attractiveness Computation by Deep Residual Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 8, 页码: 2196-2208
作者:  Fan, Yang-Yu;  Liu, Shu;  Li, Bo;  Guo, Zhe;  Samal, Ashok;  Wan, Jun;  Li, Stan Z.
浏览  |  Adobe PDF(1377Kb)  |  收藏  |  浏览/下载:385/77  |  提交时间:2018/01/04
Facial attractiveness computation  deep residual network  label distribution  feature fusion  SCUT-FBP