CASIA OpenIR

浏览/检索结果: 共635条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:95/12  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Visually Guided Sound Source Separation With Audio-Visual Predictive Coding 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 15
作者:  Song, Zengjie;  Zhang, Zhaoxiang
收藏  |  浏览/下载:39/0  |  提交时间:2023/11/17
Feature fusion  multimodal learning  predictive coding (PC)  self-supervised learning  sound source separation  
Dual feature enhanced video super-resolution network based on low-light scenarios 期刊论文
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 卷号: 115, 页码: 8
作者:  Zhang, Huan;  Cao, Yihao;  Cai, Jianghui;  Cai, Xingjuan;  Zhang, Wensheng
收藏  |  浏览/下载:70/0  |  提交时间:2023/11/17
Video super-resolution (VSR)  Feature enhancement  Information re-fusion  Attention mechanism  
RSDet++: Point-based modulated loss for more accurate rotated object detection 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 32, 期号: 11, 页码: 7869-7879
作者:  Wen Qian;  Xue Yang;  Silong Peng;  Xiujuan Zhang;  Junchi Yan
Adobe PDF(6998Kb)  |  收藏  |  浏览/下载:169/62  |  提交时间:2023/06/07
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文
, Washington D.C., USA, 2023-2-9
作者:  Qingyu Wang;  Tielin Zhang;  Minglun Han;  Yi Wang;  Duzhen Zhang;  Bo Xu
Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:137/44  |  提交时间:2023/06/20
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:118/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Patch Loss: A generic multi-scale perceptual loss for single image super-resolution 期刊论文
Pattern Recognition, 2023, 卷号: 139, 页码: 109510
作者:  An T(安泰);  Mao BJ(毛彬杰);  Xue B(薛斌);  Huo CL(霍春雷);  Xiang SM(向世明);  Pan CH(潘春洪)
Adobe PDF(5876Kb)  |  收藏  |  浏览/下载:92/13  |  提交时间:2024/01/17
Single-image super-resolution  Multi-scale loss functions  Image visual perception  Perceptual metrics  
Learning to Adapt Across Dual Discrepancy for Cross-Domain Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 2, 页码: 1963-1980
作者:  Luo, Chuanchen;  Song, Chunfeng;  Zhang, Zhaoxiang
Adobe PDF(2539Kb)  |  收藏  |  浏览/下载:204/61  |  提交时间:2023/03/20
Person re-identification  domain adaptation  cross-domain mixup  camera-aware learning  self-paced learning  
Temporal sparse adversarial attack on sequence-based gait recognition 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 133, 页码: 11
作者:  He, Ziwen;  Wang, Wei;  Dong, Jing;  Tan, Tieniu
Adobe PDF(1435Kb)  |  收藏  |  浏览/下载:289/53  |  提交时间:2022/11/21
Adversarial attack  Gait recognition  Temporal sparsity