CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Train from scratch: Single-stage joint training of speech separation and recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2022, 卷号: 76, 页码: 15
作者:  Shi, Jing;  Chang, Xuankai;  Watanabe, Shinji;  Xu, Bo
收藏  |  浏览/下载:217/0  |  提交时间:2022/07/25
Cocktail party problem  Speech separation  Multi-speaker speech recognition  End-to-end  Joint-training  
Online Audio-Visual Speech Separation with Generative Adversarial Training 会议论文
0, 线上会议, 2021-4-23
作者:  Zhang Peng;  Xu Jiaming;  Hao Yunzhe;  Xu Bo
Adobe PDF(532Kb)  |  收藏  |  浏览/下载:217/50  |  提交时间:2021/06/21
audio-visual speech separation  online processing  generative adversarial training  causal temporal convolutional network  
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文
0, 线上会议, 2021-7-18
作者:  Zhang Peng;  Xu Jiaming;  Shi Jing;  Hao Yunzhe;  Qin Lei;  Xu Bo
Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:223/57  |  提交时间:2021/06/21
audio-visual speech separation  robust  adversarial training method  time-domain approach  
Monaural speech separation based on MAXVQ and CASA for robust speech recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2010, 卷号: 24, 期号: 1, 页码: 30-44
作者:  Li, Peng;  Guan, Yong;  Wang, Shijin;  Xu, Bo;  Liu, Wenju
收藏  |  浏览/下载:60/0  |  提交时间:2020/10/27
Monaural Speech Separation  Computational Auditory Scene Analysis (Casa)  Factorial-max Vector Quantization (Maxvq)  Automatic Speech Recognition (Asr)  
Monaural voiced speech segregation based on elaborate harmonic grouping strategies 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2011, 卷号: 54, 期号: 12, 页码: 2471-2480
作者:  Liu WenJu;  Zhang XueLiang;  Jiang Wei;  Li Peng;  Xu Bo
收藏  |  浏览/下载:67/0  |  提交时间:2020/10/27
Computational Auditory Scene Analysis  Voiced Speech Separation  Harmonistic Principle  Minimum Amplitude Principle  Elaborate Harmonic Grouping Strategies  
Deep Learning Based Speech Separation via NMF-style Reconstructions 期刊论文
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2018, 卷号: 26, 期号: 11, 页码: 2043-2055
作者:  Shuai Nie;  Shan Liang;  Wenju Liu;  Xueliang Zhang;  Jianhua Tao
浏览  |  Adobe PDF(2922Kb)  |  收藏  |  浏览/下载:210/80  |  提交时间:2020/10/22
Speech separation  deep neural network (DNN)  nonnegative matrix factorization (NMF)  spectro-temporal structures  
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:300/65  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training  
Two-Stage Multi-Target Joint Learning for Monaural Speech Separation 会议论文
Annual Conference of the International Speech Communication Association (INTERSPEECH), Dresden Germany, 2015
作者:  Shuai, Nie;  Shan, Liang;  Wei, Xue;  XueLiang, Zhang;  WenJu, Liu;  Like Dong;  Hong Yang
浏览  |  Adobe PDF(185Kb)  |  收藏  |  浏览/下载:309/86  |  提交时间:2016/04/12
Speech Separation  Multi-target Learning  Computational Auditory Scene Analysis (Casa)  
EXPLOITING SPECTRO-TEMPORAL STRUCTURES USING NMF FOR DNN-BASED SUPERVISED SPEECH SEPARATION 会议论文
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shang Hai, China, 2016
作者:  Shuai, Nie;  Shan, Liang;  Hao, Li;  XueLiang, Zhang;  ZhanLei, Yang;  WenJu, Liu;  LiKe, Dong
浏览  |  Adobe PDF(408Kb)  |  收藏  |  浏览/下载:484/133  |  提交时间:2016/04/12
Speech Separation  Deep Neural Network  Nonnegative Matrix Factorization  Spectro-temporal Structures  
Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech 期刊论文
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 卷号: 14, 期号: 6, 页码: 2014-2023
作者:  Li, Peng;  Guan, Yong;  Xu, Bo;  Liu, Wenju
浏览  |  Adobe PDF(673Kb)  |  收藏  |  浏览/下载:222/62  |  提交时间:2015/11/07
Computational Auditory Scene Analysis (Casa) Grouping  Monaural Speech Separation  Objective Quality Assessment Of Speech (Oqas)  Segmentation