CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Mask Distillation Network for Conjunctival Hyperemia Severity Classification 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 6, 页码: 909-922
作者:  Mingchao Li;  Kun Huang;  Xiao Ma;  Yuexuan Wang;  Wen Fan;  Qiang Chen
Adobe PDF(2827Kb)  |  收藏  |  浏览/下载:51/17  |  提交时间:2024/04/23
Mask distillation (MD), conjunctiva hyperemia, attention mechanism, severity classification, deep learning  
MVContrast: Unsupervised Pretraining for Multi-view 3D Object Recognition 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 6, 页码: 872-883
作者:  Luequan Wang;  Hongbin Xu;  Wenxiong Kang
Adobe PDF(1954Kb)  |  收藏  |  浏览/下载:44/8  |  提交时间:2024/04/23
Multi view, unsupervised pretraining, contrastive learning, 3D vision, shape recognition  
Transmission Line Insulator Defect Detection Based on Swin Transformer and Context 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 5, 页码: 729-740
作者:  Yu Xi;  Ke Zhou;  Ling-Wen Meng;  Bo Chen;  Hao-Min Chen;  Jing-Yi Zhang
Adobe PDF(18337Kb)  |  收藏  |  浏览/下载:53/13  |  提交时间:2024/04/23
Insulator defect, object detection, Swin transformer, data augmentation, context information  
How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 5, 页码: 605-613
作者:  Haotong Qin;   Ge-Peng Ji;  Salman Khan;  Deng-Ping Fan;  Fahad Shahbaz Khan;  Luc Van Gool
Adobe PDF(10373Kb)  |  收藏  |  浏览/下载:45/9  |  提交时间:2024/04/23
Google Bard, multi-modal understanding, visual comprehension, large language models, conversational AI, chatbot  
Transformer: A General Framework from Machine Translation to Others 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 514-538
作者:  Yang Zhao;  Jiajun Zhang;  Chengqing Zong
Adobe PDF(1415Kb)  |  收藏  |  浏览/下载:54/15  |  提交时间:2024/04/23
Neural machine translation, Transformer, document neural machine translation (NMT), multimodal NMT, low-resource NMT  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:71/16  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
AI in Human-computer Gaming: Techniques, Challenges and Opportunities 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 299-317
作者:  Qi-Yue Yin;  Jun Yang;  Kai-Qi Huang;  Mei-Jing Zhao;  Wan-Cheng Ni;  Bin Liang;  Yan Huang;  Shu Wu;  Liang Wang
Adobe PDF(2608Kb)  |  收藏  |  浏览/下载:70/18  |  提交时间:2024/04/23
Human-computer gaming, AI, intelligent decision making, deep reinforcement learning, self-play  
Deep Gradient Learning for Efficient Camouflaged Object Detection 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 92-108
作者:  Ge-Peng Ji;  Deng-Ping Fan;  Yu-Cheng Chou;  Dengxin Dai;  Alexander Liniger;  Luc Van Gool
Adobe PDF(5723Kb)  |  收藏  |  浏览/下载:66/19  |  提交时间:2024/04/23
Camouflaged object detection (COD)  object gradient  soft grouping  efficient model  image segmentation  
Visual Superordinate Abstraction for Robust Concept Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 79-91
作者:  Qi Zheng;  Chao-Yue Wang;  Dadong Wang;  a-Cheng Tao
Adobe PDF(2703Kb)  |  收藏  |  浏览/下载:47/14  |  提交时间:2024/04/23
Concept learning  visual question answering  weakly-supervised learning  multi-modal learning  curriculum learning  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:56/17  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning