CASIA OpenIR

浏览/检索结果: 共30条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Comprehensive Relation Modelling for Image Paragraph Generation 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 369-382
作者:  Xianglu Zhu;  Zhang Zhang;  Wei Wang;  Zilei Wang
Adobe PDF(1963Kb)  |  收藏  |  浏览/下载:14/7  |  提交时间:2024/04/23
Image paragraph generation, visual relationship, scene graph, graph convolutional network (GCN), long short-term memory  
How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 5, 页码: 605-613
作者:  Haotong Qin;   Ge-Peng Ji;  Salman Khan;  Deng-Ping Fan;  Fahad Shahbaz Khan;  Luc Van Gool
Adobe PDF(10373Kb)  |  收藏  |  浏览/下载:5/2  |  提交时间:2024/04/23
Google Bard, multi-modal understanding, visual comprehension, large language models, conversational AI, chatbot  
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:187/69  |  提交时间:2023/07/06
基于平行学习的艺术绘画图像描述算法研究 学位论文
, 2023
作者:  鲁越
Adobe PDF(15730Kb)  |  收藏  |  浏览/下载:117/3  |  提交时间:2023/06/25
平行学习  艺术绘画  图像描述  内容描述  情感描述  
Semi-supervised cross-modal image generation with generative adversarial networks 期刊论文
Pattern Recognition, 2020, 卷号: 100, 页码: 107085
作者:  Li D(李丹);  Du CD(杜长德);  He HG(何晖光)
Adobe PDF(4031Kb)  |  收藏  |  浏览/下载:111/33  |  提交时间:2023/05/05
ArtCap: A Dataset for Image Captioning of Fine Art Paintings 期刊论文
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 页码: 12
作者:  Lu, Yue;  Guo, Chao;  Dai, Xingyuan;  Wang, Fei-Yue
Adobe PDF(5137Kb)  |  收藏  |  浏览/下载:245/44  |  提交时间:2023/02/22
Dataset construction  image captioning  painting captioning  
Networked Knowledge and Complex Networks: An Engineering View 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 8, 页码: 1366-1383
作者:  Jinhu Lü;  Guanghui Wen;  Ruqian Lu;  Yong Wang;  Songmao Zhang
Adobe PDF(1876Kb)  |  收藏  |  浏览/下载:148/21  |  提交时间:2022/08/01
Complex network  knowledge graph  networked knowledge  neural network  
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:258/48  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
从视频到语言:视频描述和标题生成方法研究 学位论文
, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022
作者:  张子琦
Adobe PDF(19170Kb)  |  收藏  |  浏览/下载:1131/15  |  提交时间:2022/06/16
视觉与语言  视频内容描述  视频标题生成  外部语言模型  开卷视频描述  中文短视频-文本基准  大规模多模态预训练  
A Multi-Task MRC Framework for Chinese Emotion Cause and Experiencer Extraction 会议论文
, Bratislava, Slovakia, 2021-09
作者:  Haoda Qian;  Qiudan Li;  Zaichuan Tang
Adobe PDF(79001Kb)  |  收藏  |  浏览/下载:331/124  |  提交时间:2022/06/14