CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
TextFormer: A Query-based End-to-end Text Spotter with Mixed Supervision 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 704-717
作者:  Yukun Zhai;   Xiaoqiang Zhang;   Xiameng Qin;   Sanyuan Zhao;  Xingping Dong;   Jianbing Shen
Adobe PDF(2312Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/07/18
End-to-end text spotting  arbitrarily-shaped texts  transformer  mixed supervision  multitask modeling  
Autonomy Evaluation of Unmanned Systems Based on Task Models 期刊论文
Machine Intelligence Research, 2024, 页码: 1-16
作者:  Yi Zou;  Zehao Ni;  Xun Lei;  Chi Zhang
Adobe PDF(1801Kb)  |  收藏  |  浏览/下载:39/11  |  提交时间:2024/06/27
Parsing Objects at a Finer Granularity: A Survey 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 431-451
作者:  Yifan Zhao;  Jia Li;  Yonghong Tian
Adobe PDF(1743Kb)  |  收藏  |  浏览/下载:40/16  |  提交时间:2024/05/23
Finer granularity, visual parsing, part segmentation, fine-grained object recognition, part relationship  
A Review of Predictive and Contrastive Self-supervised Learning for Medical Images 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 483-513
作者:  Wei-Chien Wang;  Euijoon Ahn;  Dagan Feng;  Jinman Kim
Adobe PDF(2691Kb)  |  收藏  |  浏览/下载:63/19  |  提交时间:2024/04/23
Self-supervised learning (SSL), contrastive learning, deep learning, medical image analysis, computer vision  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:70/16  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 249-262
作者:  Guyue Hu;  Bin He;  Hanwang Zhang
Adobe PDF(2167Kb)  |  收藏  |  浏览/下载:69/30  |  提交时间:2024/04/23
Prompt learning  video-language pretrained models  instructional videos  procedure understanding  knowledge distilling  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:55/17  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
Causal Reasoning Meets Visual Representation Learning: A Prospective Study 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 485-511
作者:  Yang Liu;  Yu-Shen Wei;  Hong Yan;  Guan-Bin Li;  Liang Lin
Adobe PDF(3224Kb)  |  收藏  |  浏览/下载:51/6  |  提交时间:2024/04/23
Causal reasoning  visual representation learning  reliable artificial intelligence  spatial-temporal data  multi-modal analysis  
Towards a New Paradigm for Brain-inspired Computer Vision 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 5, 页码: 412-424
作者:  Xiao-Long Zou;  Tie-Jun Huang;  Si Wu
Adobe PDF(1615Kb)  |  收藏  |  浏览/下载:43/8  |  提交时间:2024/04/23
Brain-inspired computer vision  spatio-temporal patterns  object detection  object tracking  object recognition  
Efficient Visual Recognition: A Survey on Recent Advances and Brain-inspired Methodologies 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 5, 页码: 366-411
作者:  Yang Wu;  Ding-Heng Wang;  Xiao-Tong Lu;  Fan Yang;  Man Yao;  Wei-Sheng Dong;  Jian-Bo Shi;  Guo-Qi Li
Adobe PDF(6780Kb)  |  收藏  |  浏览/下载:52/8  |  提交时间:2024/04/23
Visual recognition  deep neural networks (DNNS)  brain-inspired methodologies  network compression  dynamic inference  survey