CASIA OpenIR

浏览/检索结果: 共2条,第1-2条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 5, 页码: 605-613
作者:  Haotong Qin;   Ge-Peng Ji;  Salman Khan;  Deng-Ping Fan;  Fahad Shahbaz Khan;  Luc Van Gool
Adobe PDF(10373Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/04/23
Google Bard, multi-modal understanding, visual comprehension, large language models, conversational AI, chatbot  
Visuals to Text: A Comprehensive Review on Automatic Image Captioning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 8, 页码: 1339-1365
作者:  Yue Ming;  Nannan Hu;  Chunxiao Fan;  Fan Feng;  Jiangwan Zhou;  Hui Yu
Adobe PDF(56128Kb)  |  收藏  |  浏览/下载:150/21  |  提交时间:2022/08/01
Artificial intelligence  attention mechanism  encoder-decoder framework  image captioning  multi-modal understanding  training strategies