CASIA OpenIR

浏览/检索结果: 共375条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Comprehensive Relation Modelling for Image Paragraph Generation 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 369-382
作者:  Xianglu Zhu;  Zhang Zhang;  Wei Wang;  Zilei Wang
Adobe PDF(1963Kb)  |  收藏  |  浏览/下载:14/7  |  提交时间:2024/04/23
Image paragraph generation, visual relationship, scene graph, graph convolutional network (GCN), long short-term memory  
A Comprehensive Overview of CFN From a Commonsense Perspective 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 239-256
作者:  Ru Li;  Yunxiao Zhao;  Zhiqiang Wang;  Xuefeng Su;  Shaoru Guo;  Yong Guan;  Xiaoqi Han;  Hongyan Zhao
Adobe PDF(2392Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/04/23
Chinese FrameNet (CFN), commonsense, scenario commonsense, frame, knowledge  
Deep Industrial Image Anomaly Detection: A Survey 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 104-135
作者:  Jiaqi Liu;  Guoyang Xie;  Jinbao Wang;  Shangnian Li;  Chengjie Wang;  Feng Zheng;  Yaochu Jin
Adobe PDF(3376Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/04/23
Image anomaly detection, defect detection, industrial manufacturing, deep learning, computer vision  
AI for Supporting the Freedom of Drawing 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 63-88
作者:  Xiaohua Sun;  Juexiao Qin
Adobe PDF(14055Kb)  |  收藏  |  浏览/下载:11/6  |  提交时间:2024/04/23
Intention understanding, drawing support, drawing, art, human-artificial intelligence (AI) collaboration  
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:  Mengting Liu;  Ying Zhou;  Yuwei Wu;  Feng Gao
Adobe PDF(14438Kb)  |  收藏  |  浏览/下载:18/1  |  提交时间:2024/04/23
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation  
How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 5, 页码: 605-613
作者:  Haotong Qin;   Ge-Peng Ji;  Salman Khan;  Deng-Ping Fan;  Fahad Shahbaz Khan;  Luc Van Gool
Adobe PDF(10373Kb)  |  收藏  |  浏览/下载:5/2  |  提交时间:2024/04/23
Google Bard, multi-modal understanding, visual comprehension, large language models, conversational AI, chatbot  
Transformer: A General Framework from Machine Translation to Others 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 514-538
作者:  Yang Zhao;  Jiajun Zhang;  Chengqing Zong
Adobe PDF(1415Kb)  |  收藏  |  浏览/下载:13/5  |  提交时间:2024/04/23
Neural machine translation, Transformer, document neural machine translation (NMT), multimodal NMT, low-resource NMT  
A Review of Predictive and Contrastive Self-supervised Learning for Medical Images 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 483-513
作者:  Wei-Chien Wang;  Euijoon Ahn;  Dagan Feng;  Jinman Kim
Adobe PDF(2691Kb)  |  收藏  |  浏览/下载:13/5  |  提交时间:2024/04/23
Self-supervised learning (SSL), contrastive learning, deep learning, medical image analysis, computer vision  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:16/3  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
Vision Enhanced Generative Pre-trained Language Model for Multimodal Sentence Summarization 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 289-298
作者:  Liqiang Jing;  Yiren Li;  Junhao Xu;  Yongcan Yu;  Pei Shen;  Xuemeng Song
Adobe PDF(2389Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/04/23
Multimodal sentence summarization (MMSS)  generative pre-trained language model (GPLM)  natural language generation  deep learning  artificial intelligence