CASIA OpenIR

浏览/检索结果: 共59条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Boosting Performance on 3D Object Detection with a Plug-in Discrimination Module 会议论文
, Singapore, 2024.02.23
作者:  Yi Yang;  Zhang Zhang
Adobe PDF(373Kb)  |  收藏  |  浏览/下载:12/7  |  提交时间:2024/06/11
AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models 会议论文
, VANCOUVER, CANADA, 2024-2-20至2024-2-27
作者:  Zhaopeng Gu;  Bingke Zhu;  Guibo Zhu;  Yingying Chen;  Ming Tang;  Jinqiao Wang
Adobe PDF(4846Kb)  |  收藏  |  浏览/下载:29/1  |  提交时间:2024/06/06
UniGen: Unified Generative Pre-training for Multilingual Multimodal Representation 会议论文
, Waseda University, Tokyo, Japan, 2024.03.15-2024.03.18
作者:  Zheyuan, Tian;  Guan, Luo;  Bo, Wang;  Bing, Li;  Weiming, Hu
Adobe PDF(975Kb)  |  收藏  |  浏览/下载:32/6  |  提交时间:2024/05/31
PCEN: Potential Correlation-Enhanced Network for Multimodal Named Entity Recognition 会议论文
, Charlotte, NC, USA, 02-03 October 2023
作者:  Jiakai Geng;  Chenyang Zhang;  Linjing Li;  Qing Yang;  Daniel Zeng
Adobe PDF(4985Kb)  |  收藏  |  浏览/下载:19/4  |  提交时间:2024/05/31
named entity recognition  multimodal learning  vision-language pre-trained model  inconsistency loss  
Prototype Calibration with Synthesized Samples for Zero-Shot Chinese Character Recognition 会议论文
, Seoul, Korea, 14-19 April 2024
作者:  Ao, Xiang;  Li, Xiao-Hui;  Zhang, Xu-Yao;  Liu, Cheng-Lin
Adobe PDF(1434Kb)  |  收藏  |  浏览/下载:21/8  |  提交时间:2024/05/30
Cross-modal Prototype Learning for Zero-shot Handwriting Recognition 会议论文
, Sydney, Australia, 20-25 Septemper 2019
作者:  Ao, Xiang;  Zhang, Xu-Yao;  Yang, Hong-Ming;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(226Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/05/30
printed character  handwritten character  cross-modal  prototype learning  zero-shot  
BEVBert: Multimodal Map Pre-training for Language-guided Navigation 会议论文
Proceedings of the IEEE International Conference on Computer Vision, Paris, France, 2023-10-2
作者:  Dong An;  Yuankai Qi;  Yangguang Li;  Yan Huang;  Liang Wang;  Tieniu Tan;  Jing Shao
Adobe PDF(1722Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/05/28
Neighbor-view Enhanced Model for Vision and Language Navigation 会议论文
Proceedings of the ACM International Conference on Multimedia, Chengdu, China, 2021-10-20
作者:  Dong An;  Yuankai Qi;  Yan Huang;  Qi Wu;  Liang Wang;  Tieniu Tan
Adobe PDF(2412Kb)  |  收藏  |  浏览/下载:9/3  |  提交时间:2024/05/28
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks 会议论文
, New Orleans, Louisiana & Online, 2022-11-28
作者:  Chen, Zhiyang;  Zhu, Yousong;  Li, Zhaowen;  Yang, Fan;  Li, Wei;  Wang, Haixin;  Zhao, Chaoyang;  Wu, Liwei;  Zhao, Rui;  Wang, Jinqiao;  Tang, Ming
Adobe PDF(1289Kb)  |  收藏  |  浏览/下载:14/3  |  提交时间:2024/05/28
transformer  general visual framework  sequence prediction  multi-task  
Identifying Topic and Cause for Sarcasm An Unsupervised Knowledge-enhanced Prompt Method 会议论文
WWW’23 Companion, Austin, TX, USA, 2023-4
作者:  Minjie, Yuan;  Qiudan, Li;  Xue, Mao;  Daniel Dajun, Zeng
Adobe PDF(447Kb)  |  收藏  |  浏览/下载:17/6  |  提交时间:2024/05/28