CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6906-6916
作者:  Wang, Wenxuan;  He, Xingjian;  Zhang, Yisi;  Guo, Longteng;  Shen, Jiachen;  Li, Jiangyun;  Liu, Jing
收藏  |  浏览/下载:10/0  |  提交时间:2024/07/03
Referring image segmentation  cross-modality guidance  masked self-distillation  vision and language  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:352/83  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:250/41  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
Multiview Label Sharing for Visual Representations and Classifications 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 4, 页码: 903-913
作者:  Zhang, Chunjie;  Cheng, Jian;  Tian, Qi
Adobe PDF(615Kb)  |  收藏  |  浏览/下载:394/110  |  提交时间:2018/10/10
Multi-view Learning  Linear Transformation  Shared Space  Image Representation  Visual Classification  
EgoGesture: A New Dataset and Benchmark for Egocentric Hand Gesture Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 5, 页码: 1038-1050
作者:  Zhang, Yifan;  Cao, Congqi;  Cheng, Jian;  Lu, Hanqing
浏览  |  Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:996/373  |  提交时间:2018/05/05
Benchmark  Dataset  Egocentric Vision  Gesture Recognition  First-person View