CASIA OpenIR

浏览/检索结果: 共27条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:401/1  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability  
Instance GNN: A Learning Framework for Joint Symbol Segmentation and Recognition in Online Handwritten Diagrams 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2580-2594
作者:  Yun, Xiao-Long;  Zhang, Yan-Ming;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(3236Kb)  |  收藏  |  浏览/下载:313/3  |  提交时间:2022/07/25
Handwriting recognition  Task analysis  Grammar  Semantics  Image segmentation  Trajectory  Text recognition  Online handwritten diagram recognition  symbol segmentation  symbol recognition  freehand sketch analysis  graph neural networks  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:330/73  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:232/36  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
Effective Image Retrieval via Multilinear Multi-Index Fusion 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 11, 页码: 2878-2890
作者:  Zhang, Zhizhong;  Xie, Yuan;  Zhang, Wensheng;  Tian, Qi
浏览  |  Adobe PDF(1024Kb)  |  收藏  |  浏览/下载:440/106  |  提交时间:2020/03/30
Visualization  Image representation  Optimization  Buildings  Indexing  Image retrieval  multi-index fusion  tensor multi-rank  person re-identification  
Deep Multi-Modality Adversarial Networks for Unsupervised Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 9, 页码: 2419-2431
作者:  Ma, Xinhong;  Zhang, Tianzhu;  Xu, Changsheng
Adobe PDF(2142Kb)  |  收藏  |  浏览/下载:374/48  |  提交时间:2019/12/16
Unsupervised domain adaptation  triplet loss  stacked attention  multi-modality  social event recognition  
Enhancing Image Watermarking With Adaptive Embedding Parameter and PSNR Guarantee 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 10, 页码: 2447-2460
作者:  Huang, Ying;  Niu, Baoning;  Guan, Hu;  Zhang, Shuwu
浏览  |  Adobe PDF(2733Kb)  |  收藏  |  浏览/下载:413/92  |  提交时间:2019/12/16
Adaptive watermarking  differential quantization  image watermarking  spread spectrum  
Weakly Semantic Guided Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 10, 页码: 2504-2517
作者:  Yu, Tingzhao;  Wang, Lingfeng;  Da, Cheng;  Gu, Huxiang;  Xiang, Shiming;  Pan, Chunhong
浏览  |  Adobe PDF(18774Kb)  |  收藏  |  浏览/下载:459/114  |  提交时间:2019/05/15
Semantic guided module  action recognition  cross domain  3D convolution  attention model  
EgoGesture: A New Dataset and Benchmark for Egocentric Hand Gesture Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 5, 页码: 1038-1050
作者:  Zhang, Yifan;  Cao, Congqi;  Cheng, Jian;  Lu, Hanqing
浏览  |  Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:967/366  |  提交时间:2018/05/05
Benchmark  Dataset  Egocentric Vision  Gesture Recognition  First-person View  
Text2Video: An End-to-end Learning Framework for Expressing Text With Videos 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 9, 页码: 2360-2370
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:539/147  |  提交时间:2018/02/07
Multimedia Storytelling  Video Analysis  Deep Learning