CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
A Deep Model for Partial Multi-label Image Classification with Curriculum-based Disambiguation 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 801-814
作者:  Feng Sun;  Ming-Kun Xie;  Sheng-Jun Huang
Adobe PDF(1337Kb)  |  收藏  |  浏览/下载:23/7  |  提交时间:2024/07/18
Partial multi-label image classification  curriculum-based disambiguation  consistency regularization  label difficulty  candidate label set.  
ReChoreoNet: Repertoire-based Dance Re-choreography with Music-conditioned Temporal and Style Clues 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 771-781
作者:  Ho Yin Au;  Jie Chen;  Junkun Jiang;  Yike Guo
Adobe PDF(2161Kb)  |  收藏  |  浏览/下载:10/3  |  提交时间:2024/07/18
Generative model  cross-modality learning  normalizing flow  tempo synchronization  style transfer  
TextFormer: A Query-based End-to-end Text Spotter with Mixed Supervision 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 704-717
作者:  Yukun Zhai;   Xiaoqiang Zhang;   Xiameng Qin;   Sanyuan Zhao;  Xingping Dong;   Jianbing Shen
Adobe PDF(2312Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/07/18
End-to-end text spotting  arbitrarily-shaped texts  transformer  mixed supervision  multitask modeling  
Adaptively Enhancing Facial Expression Crucial Regions via a Local Non-local Joint Network 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 331-348
作者:  Guanghui Shi;  Shasha Mao;  Shuiping Gou;  Dandan Yan;  Licheng Jiao;  Lin Xiong
Adobe PDF(3926Kb)  |  收藏  |  浏览/下载:56/18  |  提交时间:2024/04/23
Facial expression recognition, deep neural network, multiple network ensemble, attention network, facial crucial regions  
Boosting Multi-modal Ocular Recognition via Spatial Feature Reconstruction and Unsupervised Image Quality Estimation 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 197-214
作者:  Zihui Yan;  Yunlong Wang;  Kunbo Zhang;  Zhenan Sun;  Lingxiao He
Adobe PDF(3457Kb)  |  收藏  |  浏览/下载:52/16  |  提交时间:2024/04/23
Iris recognition, periocular recognition, spatial feature reconstruction, fully convolutional network, flexible matching, unsupervised iris quality assessment, adaptive weight fusion  
Deep Industrial Image Anomaly Detection: A Survey 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 104-135
作者:  Jiaqi Liu;  Guoyang Xie;  Jinbao Wang;  Shangnian Li;  Chengjie Wang;  Feng Zheng;  Yaochu Jin
Adobe PDF(3376Kb)  |  收藏  |  浏览/下载:58/10  |  提交时间:2024/04/23
Image anomaly detection, defect detection, industrial manufacturing, deep learning, computer vision  
Cogeneration of Innovative Audio-visual Content: A New Challenge for Computing Art 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 4-28
作者:  Mengting Liu;  Ying Zhou;  Yuwei Wu;  Feng Gao
Adobe PDF(14438Kb)  |  收藏  |  浏览/下载:65/9  |  提交时间:2024/04/23
Artificial intelligence (AI) art, audio-visual, artificial intelligence generated content (AIGC), multimodal, artistic evaluation  
Effective Model Compression via Stage-wise Pruning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 6, 页码: 937-951
作者:  Ming-Yang Zhang;  Xin-Yi Yu;  Lin-Lin Ou
Adobe PDF(2394Kb)  |  收藏  |  浏览/下载:31/13  |  提交时间:2024/04/23
Automated machine learning (AutoML), channel pruning, model compression, distillation, convolutional neural networks (CNN)  
Rolling Shutter Camera: Modeling, Optimization and Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 6, 页码: 783-798
作者:  Bin Fan;  Yuchao Dai;  Mingyi He
Adobe PDF(2943Kb)  |  收藏  |  浏览/下载:42/14  |  提交时间:2024/04/23
Rolling shutter, motion modeling, image correction, temporal super-resolution, deep learning  
Compositional Prompting Video-language Models to Understand Procedure in Instructional Videos 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 249-262
作者:  Guyue Hu;  Bin He;  Hanwang Zhang
Adobe PDF(2167Kb)  |  收藏  |  浏览/下载:69/30  |  提交时间:2024/04/23
Prompt learning  video-language pretrained models  instructional videos  procedure understanding  knowledge distilling