CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Train from scratch: Single-stage joint training of speech separation and recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2022, 卷号: 76, 页码: 15
作者:  Shi, Jing;  Chang, Xuankai;  Watanabe, Shinji;  Xu, Bo
收藏  |  浏览/下载:206/0  |  提交时间:2022/07/25
Cocktail party problem  Speech separation  Multi-speaker speech recognition  End-to-end  Joint-training  
Online Audio-Visual Speech Separation with Generative Adversarial Training 会议论文
0, 线上会议, 2021-4-23
作者:  Zhang Peng;  Xu Jiaming;  Hao Yunzhe;  Xu Bo
Adobe PDF(532Kb)  |  收藏  |  浏览/下载:200/47  |  提交时间:2021/06/21
audio-visual speech separation  online processing  generative adversarial training  causal temporal convolutional network  
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文
0, 线上会议, 2021-7-18
作者:  Zhang Peng;  Xu Jiaming;  Shi Jing;  Hao Yunzhe;  Qin Lei;  Xu Bo
Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:205/51  |  提交时间:2021/06/21
audio-visual speech separation  robust  adversarial training method  time-domain approach  
Monaural speech separation based on MAXVQ and CASA for robust speech recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2010, 卷号: 24, 期号: 1, 页码: 30-44
作者:  Li, Peng;  Guan, Yong;  Wang, Shijin;  Xu, Bo;  Liu, Wenju
收藏  |  浏览/下载:54/0  |  提交时间:2020/10/27
Monaural Speech Separation  Computational Auditory Scene Analysis (Casa)  Factorial-max Vector Quantization (Maxvq)  Automatic Speech Recognition (Asr)  
Monaural voiced speech segregation based on elaborate harmonic grouping strategies 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2011, 卷号: 54, 期号: 12, 页码: 2471-2480
作者:  Liu WenJu;  Zhang XueLiang;  Jiang Wei;  Li Peng;  Xu Bo
收藏  |  浏览/下载:63/0  |  提交时间:2020/10/27
Computational Auditory Scene Analysis  Voiced Speech Separation  Harmonistic Principle  Minimum Amplitude Principle  Elaborate Harmonic Grouping Strategies  
Deep Learning Based Speech Separation via NMF-style Reconstructions 期刊论文
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2018, 卷号: 26, 期号: 11, 页码: 2043-2055
作者:  Shuai Nie;  Shan Liang;  Wenju Liu;  Xueliang Zhang;  Jianhua Tao
浏览  |  Adobe PDF(2922Kb)  |  收藏  |  浏览/下载:198/76  |  提交时间:2020/10/22
Speech separation  deep neural network (DNN)  nonnegative matrix factorization (NMF)  spectro-temporal structures  
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:275/58  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training  
Cross-domain cooperative deep stacking network for speech separation 会议论文
EI, Brisbane, Australia, April 19-24, 2015
作者:  Wei Jiang;  Shan Liang;  Like Dong;  Hong Yang;  Wenju Liu;  Yunji Wang
浏览  |  Adobe PDF(118Kb)  |  收藏  |  浏览/下载:308/113  |  提交时间:2016/10/25
Speech Separation  Cross-domain Cooperative Structure  Deep Stacking Network  Deep Neural Network  
Two-Stage Multi-Target Joint Learning for Monaural Speech Separation 会议论文
Annual Conference of the International Speech Communication Association (INTERSPEECH), Dresden Germany, 2015
作者:  Shuai, Nie;  Shan, Liang;  Wei, Xue;  XueLiang, Zhang;  WenJu, Liu;  Like Dong;  Hong Yang
浏览  |  Adobe PDF(185Kb)  |  收藏  |  浏览/下载:296/81  |  提交时间:2016/04/12
Speech Separation  Multi-target Learning  Computational Auditory Scene Analysis (Casa)  
EXPLOITING SPECTRO-TEMPORAL STRUCTURES USING NMF FOR DNN-BASED SUPERVISED SPEECH SEPARATION 会议论文
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shang Hai, China, 2016
作者:  Shuai, Nie;  Shan, Liang;  Hao, Li;  XueLiang, Zhang;  ZhanLei, Yang;  WenJu, Liu;  LiKe, Dong
浏览  |  Adobe PDF(408Kb)  |  收藏  |  浏览/下载:473/131  |  提交时间:2016/04/12
Speech Separation  Deep Neural Network  Nonnegative Matrix Factorization  Spectro-temporal Structures