CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 569-582
作者:  Haoyu Lu;  Yuqi Huo;  Mingyu Ding;  Nanyi Fei;  Zhiwu Lu
Adobe PDF(2928Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/04/23
Image-text retrieval, multimodal modeling, contrastive learning, weak correlation, computer vision  
Federated Learning on Multimodal Data: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 539-553
作者:  Yi-Ming Lin;   Yuan Gao;  Mao-Guo Gong;  Si-Jia Zhang;  Yuan-Qiao Zhang;  Zhi-Yuan Li
Adobe PDF(1253Kb)  |  收藏  |  浏览/下载:11/5  |  提交时间:2024/04/23
Federated learning, multimodal learning, heterogeneous data, edge computing, collaborative learning  
Transformer: A General Framework from Machine Translation to Others 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 514-538
作者:  Yang Zhao;  Jiajun Zhang;  Chengqing Zong
Adobe PDF(1415Kb)  |  收藏  |  浏览/下载:13/5  |  提交时间:2024/04/23
Neural machine translation, Transformer, document neural machine translation (NMT), multimodal NMT, low-resource NMT  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:14/3  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
Masked Vision-language Transformer in Fashion 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:  Ge-Peng Ji;  Mingchen Zhuge;  Dehong Gao;  Deng-Ping Fan;  Christos Sakaridis;  Luc Van Gool
Adobe PDF(2779Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/04/23
Vision-language, masked image reconstruction, transformer, fashion, e-commercial  
Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 249-262
作者:  Guyue Hu;  Bin He;  Hanwang Zhang
Adobe PDF(2167Kb)  |  收藏  |  浏览/下载:17/9  |  提交时间:2024/04/23
Prompt learning  video-language pretrained models  instructional videos  procedure understanding  knowledge distilling  
Multimodal Pretraining from Monolingual to Multilingual 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 220-232
作者:  Liang Zhang;  Ludan Ruan;  Anwen Hu;  Qin Jin
Adobe PDF(3024Kb)  |  收藏  |  浏览/下载:12/4  |  提交时间:2024/04/23
Multilingual pretraining  multimodal pretraining  cross-lingual transfer  multilingual generation  cross-modal retrieval  
Editorial for Special Issue on Large-scale Pre-training: Data, Models, and Fine-tuning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 145-146
作者:  Ji-Rong Wen;  Zi Huang;  Hanwang Zhang
Adobe PDF(513Kb)  |  收藏  |  浏览/下载:5/2  |  提交时间:2024/04/23
Visual Superordinate Abstraction for Robust Concept Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 79-91
作者:  Qi Zheng;  Chao-Yue Wang;  Dadong Wang;  a-Cheng Tao
Adobe PDF(2703Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/04/23
Concept learning  visual question answering  weakly-supervised learning  multi-modal learning  curriculum learning  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:9/3  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning