CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:  Mengting Liu;  Ying Zhou;  Yuwei Wu;  Feng Gao
Adobe PDF(14438Kb)  |  收藏  |  浏览/下载:15/1  |  提交时间:2024/04/23
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:14/3  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
Deep Learning-based Moving Object Segmentation: Recent Progress and Research Prospects 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 335-369
作者:  Rui Jiang;  Ruixiang Zhu;  Hu Su;  Yinlin Li;  Yuan Xie;  Wei Zou
Adobe PDF(9061Kb)  |  收藏  |  浏览/下载:6/0  |  提交时间:2024/04/23
Moving object segmentation (MOS), change detection, background subtraction, deep learning (DL), video understanding  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:9/3  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
Efficient Visual Recognition: A Survey on Recent Advances and Brain-inspired Methodologies 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 5, 页码: 366-411
作者:  Yang Wu;  Ding-Heng Wang;  Xiao-Tong Lu;  Fan Yang;  Man Yao;  Wei-Sheng Dong;  Jian-Bo Shi;  Guo-Qi Li
Adobe PDF(6780Kb)  |  收藏  |  浏览/下载:13/3  |  提交时间:2024/04/23
Visual recognition  deep neural networks (DNNS)  brain-inspired methodologies  network compression  dynamic inference  survey  
TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 3, 页码: 227-246
作者:  Xiao-Yu Zhang;  Hai-Chao Shi;  Chang-Sheng Li;  Li-Xin Duan
Adobe PDF(3616Kb)  |  收藏  |  浏览/下载:8/2  |  提交时间:2024/04/23
Knowledge transfer  weakly supervised learning  self-attention mechanism  representation learning  action localization  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:138/28  |  提交时间:2023/06/21