CASIA OpenIR

Browse/Search Results:  1-10 of 103 Help

Selected(0)Clear Items/Page:    Sort:
基于一致性分析的伪造图像鉴别方法研究 学位论文
, 2024
Authors:  白炜铭
Adobe PDF(17362Kb)  |  Favorite  |  View/Download:3/0  |  Submit date:2024/05/27
伪造图像鉴别  计算机生成图像鉴别  伪造人脸鉴别  一致性线索  
推理机制启发的视觉语言导航 学位论文
, 2024
Authors:  安东
Adobe PDF(10930Kb)  |  Favorite  |  View/Download:18/2  |  Submit date:2024/05/27
视觉语言导航  模块化推理  认知地图  子目标导航  
面向特征学习的图像开集识别方法研究 学位论文
, 2024
Authors:  孙珈因
Adobe PDF(8220Kb)  |  Favorite  |  View/Download:30/2  |  Submit date:2024/05/23
开集识别  分布建模  层级注意力  频域滤波  反事实去混淆  直推式框架  
Parsing Objects at a Finer Granularity: A Survey 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 431-451
Authors:  Yifan Zhao;  Jia Li;  Yonghong Tian
Adobe PDF(1743Kb)  |  Favorite  |  View/Download:5/4  |  Submit date:2024/05/23
Finer granularity, visual parsing, part segmentation, fine-grained object recognition, part relationship  
从视频到语言:视频标题生成与描述研究综述 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 2, 页码: 375-397
Authors:  汤鹏杰;  王瀚漓
Adobe PDF(8546Kb)  |  Favorite  |  View/Download:3/1  |  Submit date:2024/05/20
视频描述  卷积神经网络  循环神经网络  语段生成  情感表达  逻辑语义  
A Review of Predictive and Contrastive Self-supervised Learning for Medical Images 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 483-513
Authors:  Wei-Chien Wang;  Euijoon Ahn;  Dagan Feng;  Jinman Kim
Adobe PDF(2691Kb)  |  Favorite  |  View/Download:17/5  |  Submit date:2024/04/23
Self-supervised learning (SSL), contrastive learning, deep learning, medical image analysis, computer vision  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
Authors:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  Favorite  |  View/Download:20/4  |  Submit date:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 249-262
Authors:  Guyue Hu;  Bin He;  Hanwang Zhang
Adobe PDF(2167Kb)  |  Favorite  |  View/Download:21/11  |  Submit date:2024/04/23
Prompt learning  video-language pretrained models  instructional videos  procedure understanding  knowledge distilling  
Causal Reasoning Meets Visual Representation Learning: A Prospective Study 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 6, 页码: 485-511
Authors:  Yang Liu;  Yu-Shen Wei;  Hong Yan;  Guan-Bin Li;  Liang Lin
Adobe PDF(3224Kb)  |  Favorite  |  View/Download:14/2  |  Submit date:2024/04/23
Causal reasoning  visual representation learning  reliable artificial intelligence  spatial-temporal data  multi-modal analysis  
TwinNet: Twin Structured Knowledge Transfer Network for Weakly Supervised Action Localization 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 3, 页码: 227-246
Authors:  Xiao-Yu Zhang;  Hai-Chao Shi;  Chang-Sheng Li;  Li-Xin Duan
Adobe PDF(3616Kb)  |  Favorite  |  View/Download:10/2  |  Submit date:2024/04/23
Knowledge transfer  weakly supervised learning  self-attention mechanism  representation learning  action localization