CASIA OpenIR

浏览/检索结果: 共40条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Calibration & Reconstruction: Deep Integrated Language for Referring Image Segmentation 会议论文
Proceedings of the 2024 International Conference on Multimedia Retrieval, Phuket, Thailand, 2024/03/08
作者:  Yichen Yan;  Xingjian He;  Sihan Chen;  Jing Liu
Adobe PDF(2868Kb)  |  收藏  |  浏览/下载:24/9  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
Fuse & Calibrate: A bi-directional Vision-Language Guided Framework for Referring Image Segmentation 会议论文
, Tianjin, China, 2024/08/05
作者:  Yichen Yan;  Xingjian He;  Sihan Chen;  Shichen Lu;  Jing Liu
Adobe PDF(1978Kb)  |  收藏  |  浏览/下载:23/9  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
Exploiting Curriculum Learning in Unsupervised Neural Machine Translation 会议论文
, Online, November 7–11, 2021
作者:  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(866Kb)  |  收藏  |  浏览/下载:64/23  |  提交时间:2024/06/13
Select the Best Translation from Different Systems Without Reference 会议论文
, 中国,敦煌, 2019-9
作者:  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(582Kb)  |  收藏  |  浏览/下载:60/20  |  提交时间:2024/06/13
TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training 会议论文
, New Orleans Convention Center ,America, 2022-11-28至 2022-12-9
作者:  Yulong Liu;  Guibo Zhu;  Bin Zhu;  Qi Song;  Guojing Ge;  Haoran Chen;  Guanhui Qiao;  Ru Peng;  Lingxiang Wu;  Jinqiao Wang
Adobe PDF(2408Kb)  |  收藏  |  浏览/下载:50/10  |  提交时间:2024/06/06
Mst: Masked self-supervised transformer for visual representation 会议论文
, 北京(虚拟会议), 2021
作者:  Li, Zhaowen;  Chen, Zhiyang;  Yang, Fan;  Li, Wei;  Zhu, Yousong;  Zhao, Chaoyang;  Zhao, Rui;  Deng, Rui;  Tang, Ming;  Wang, Jinqiao
Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:60/17  |  提交时间:2024/05/30
The Devil is in Details: Delving Into Lite FFN Design for Vision Transformers 会议论文
, Seoul, Korea, 2024-4-14
作者:  Chen, Zhiyang;  Zhu, Yousong;  Li, Zhaowen;  Yang, Fan;  Zhao, Chaoyang;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(407Kb)  |  收藏  |  浏览/下载:61/17  |  提交时间:2024/05/28
Vision Transformer  Light-Weight Structure  Feed-Forward Networks  
WL-MSR: Watch and Listen for Multimodal Subtitle Recognition 会议论文
, Greece, 2023-6-4
作者:  Liu, Jiawei;  Wang, Hao;  Wang, Weining;  He, Xingjian;  Liu, Jing
Adobe PDF(1673Kb)  |  收藏  |  浏览/下载:183/41  |  提交时间:2023/07/06
Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering 会议论文
, 线上, 2020-10
作者:  Liu, Fei;  Liu, Jing;  Zhu, Xinxin;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2797Kb)  |  收藏  |  浏览/下载:372/188  |  提交时间:2022/06/15
Densely Connected Attention Flow for Visual Question Answering 会议论文
, 中国澳门, 2019-8
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang
Adobe PDF(681Kb)  |  收藏  |  浏览/下载:170/69  |  提交时间:2022/06/14