CASIA OpenIR

Browse/Search Results:  1-2 of 2 Help

Selected(0)Clear Items/Page:    Sort:
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
Authors:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  Favorite  |  View/Download:35/7  |  Submit date:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 249-262
Authors:  Guyue Hu;  Bin He;  Hanwang Zhang
Adobe PDF(2167Kb)  |  Favorite  |  View/Download:40/17  |  Submit date:2024/04/23
Prompt learning  video-language pretrained models  instructional videos  procedure understanding  knowledge distilling