CASIA OpenIR

浏览/检索结果: 共548条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:96/9  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Temporal dendritic heterogeneity incorporated with spiking neural networks for learning multi-timescale dynamics 期刊论文
NATURE COMMUNICATIONS, 2024, 卷号: 15, 期号: 1, 页码: 20
作者:  Zheng, Hanle;  Zheng, Zhong;  Hu, Rui;  Xiao, Bo;  Wu, Yujie;  Yu, Fangwen;  Liu, Xue;  Li, Guoqi;  Deng, Lei
收藏  |  浏览/下载:36/0  |  提交时间:2024/02/21
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:  Xu, Jiaming;  Cui, Jian;  Hao, Yunzhe;  Xu, Bo
收藏  |  浏览/下载:37/0  |  提交时间:2024/02/22
Cocktail party problem  target speaker separation  multi-cue guided separation  semi-supervised learning  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:34/10  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Enhancing Dimensional Emotion Recognition from Speech through Modulation-Filtered Cochleagram and Parallel Attention Recurrent Network 期刊论文
ELECTRONICS, 2023, 卷号: 12, 期号: 22, 页码: 15
作者:  Peng, Zhichao;  Zeng, Hua;  Li, Yongwei;  Du, Yegang;  Dang, Jianwu
收藏  |  浏览/下载:45/0  |  提交时间:2024/02/22
modulation-filtered cochleagram  parallel attention recurrent neural network  dimensional emotion recognition  auditory signal processing  noise-robust  
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
作者:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:5/0  |  提交时间:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
TENET: Beyond Pseudo-Labeling for Semi-supervised Few-shot Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 0
作者:  Ma CC(马成丞);  Dong WM(董未名);  Xu CS(徐常胜)
Adobe PDF(741Kb)  |  收藏  |  浏览/下载:95/22  |  提交时间:2024/01/29
Semi-supervised few-shot learning  few-shot learning  pseudo-labeling  linear regression  low-rank reconstruction  
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition 期刊论文
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
作者:  Fan, Cunhang;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:25/0  |  提交时间:2023/11/16
Robust end-to-end ASR  Speech enhancement  Masking and mapping  Speech distortion  Deep spectrum fusion  
SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 卷号: 14, 期号: 3, 页码: 2415-2429
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
收藏  |  浏览/下载:76/0  |  提交时间:2023/11/15
Emotion recognition  Feature extraction  Training  Acoustics  Semisupervised learning  Benchmark testing  Hidden Markov models  Semi-supervised multi-modal interaction network (SMIN)  conversational emotion recognition  semi-supervised learning  intra-modal interaction  cross-modal interaction  
Streaming Stroke Classification of Online Handwriting 会议论文
, Rhodes Island, Greece, 2023-6-9
作者:  Jing-Yu, Liu;  Yan-Ming, Zhang;  Fei Yin;  Cheng-Lin Liu
Adobe PDF(813Kb)  |  收藏  |  浏览/下载:228/109  |  提交时间:2023/06/28