CASIA OpenIR

浏览/检索结果: 共47条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CLIP-Driven hierarchical fusion for referring image segmentation 会议论文
, Kunming, China, 2024/03/08
作者:  Yichen Yan;  Xingjian He;  Jing Liu
Adobe PDF(5233Kb)  |  收藏  |  浏览/下载:54/12  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval? 会议论文
, 美国西雅图, 2024-6
作者:  chen yuxin;  ma zongyang;  zhang ziqi;  qi zhongang;  yuan chunfeng;  li bing;  pu junfu;  shan ying;  qi xiaojuan;  hu weiming
Adobe PDF(1070Kb)  |  收藏  |  浏览/下载:53/13  |  提交时间:2024/06/25
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval 会议论文
, 加拿大温哥华, 2023-6
作者:  chen yuxin;  ma zongyang;  zhang ziqi;  qi zhongang;  yuan chunfeng;  shan ying;  li bing;  hu weiming;  qie xiaohu;  wu jianping
Adobe PDF(1379Kb)  |  收藏  |  浏览/下载:38/10  |  提交时间:2024/06/25
Generating Relevant Article Comments via Variational Multi-Layer Fusion 会议论文
, Yokohama, Japan, 2024-7
作者:  Zou HY(邹瀚仪);  Xu HF(徐会芳);  Kong QC(孔庆超);  Cao YL(曹艺琳);  Mao WJ(毛文吉)
Adobe PDF(354Kb)  |  收藏  |  浏览/下载:45/15  |  提交时间:2024/06/24
article comment generation  variational auto-encoder  relevant information extraction  multi-layer fusion  
Controllable News Comment Generation based on Attribute Level Contrastive Learning 会议论文
, Charlotte, NC, USA, 2023-10
作者:  Zou HY(邹瀚仪);  Xu N(徐楠);  Kong QC(孔庆超);  Mao WJ(毛文吉)
Adobe PDF(317Kb)  |  收藏  |  浏览/下载:39/12  |  提交时间:2024/06/24
controllable text generation  news comment generation  attribute level constraint  contrastive learning  
Learning to Understand Traffic Signs 会议论文
, 四川成都, 2021年10月20日-24日
作者:  Guo, Yunfei;  Feng, Wei;  Yin, Fei;  Xue, Tao;  Mei, Shuqi;  Liu, Cheng-Lin
Adobe PDF(3271Kb)  |  收藏  |  浏览/下载:59/24  |  提交时间:2024/06/13
traffic sign understanding  semantic description  multi-task learning  
Regularizing Vector Embedding in Bottom-Up Human Pose Estimation 会议论文
, Tel Aviv, Israel, October 23-27, 2022
作者:  Wang Haixin;  Zhou Lu;  Chen Yingying;  Tang Ming;  Wang Jinqiao
Adobe PDF(1793Kb)  |  收藏  |  浏览/下载:44/19  |  提交时间:2024/06/04
UniGen: Unified Generative Pre-training for Multilingual Multimodal Representation 会议论文
, Waseda University, Tokyo, Japan, 2024.03.15-2024.03.18
作者:  Zheyuan, Tian;  Guan, Luo;  Bo, Wang;  Bing, Li;  Weiming, Hu
Adobe PDF(975Kb)  |  收藏  |  浏览/下载:75/19  |  提交时间:2024/05/31
Mst: Masked self-supervised transformer for visual representation 会议论文
, 北京(虚拟会议), 2021
作者:  Li, Zhaowen;  Chen, Zhiyang;  Yang, Fan;  Li, Wei;  Zhu, Yousong;  Zhao, Chaoyang;  Zhao, Rui;  Deng, Rui;  Tang, Ming;  Wang, Jinqiao
Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:65/18  |  提交时间:2024/05/30
Neighbor-view Enhanced Model for Vision and Language Navigation 会议论文
Proceedings of the ACM International Conference on Multimedia, Chengdu, China, 2021-10-20
作者:  Dong An;  Yuankai Qi;  Yan Huang;  Qi Wu;  Liang Wang;  Tieniu Tan
Adobe PDF(2412Kb)  |  收藏  |  浏览/下载:56/23  |  提交时间:2024/05/28