CASIA OpenIR

浏览/检索结果: 共18条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
End-to-End Paired Ambisonic-Binaural Audio Rendering 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 502-513
作者:  Yin Zhu;  Qiuqiang Kong;  Junjie Shi;  Shilei Liu;  Xuzhou Ye;  Ju-Chiang Wang;  Hongming Shan;  Junping Zhang
Adobe PDF(9612Kb)  |  收藏  |  浏览/下载:39/12  |  提交时间:2024/01/23
Ambisonic  attention  binaural rendering  neural network  
Compressing Speaker Extraction Model with Ultra-low Precision Quantization and Knowledge Distillation 期刊论文
Neural Networks, 2022, 卷号: 154, 页码: 13-21
作者:  Yating Huang;  Yunzhe Hao;  Jiaming Xu;  Bo Xu
Adobe PDF(801Kb)  |  收藏  |  浏览/下载:185/50  |  提交时间:2022/09/17
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:373/48  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:276/58  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training  
鸡尾酒会问题与相关听觉模型的研究现状与展望 期刊论文
自动化学报, 2019, 卷号: 45, 期号: 2, 页码: 234-251
作者:  黄雅婷;  石晶;  许家铭;  徐波
Adobe PDF(3009Kb)  |  收藏  |  浏览/下载:170/50  |  提交时间:2022/09/17
Deep Learning Based Speech Separation via NMF-style Reconstructions 期刊论文
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2018, 卷号: 26, 期号: 11, 页码: 2043-2055
作者:  Shuai Nie;  Shan Liang;  Wenju Liu;  Xueliang Zhang;  Jianhua Tao
浏览  |  Adobe PDF(2922Kb)  |  收藏  |  浏览/下载:198/76  |  提交时间:2020/10/22
Speech separation  deep neural network (DNN)  nonnegative matrix factorization (NMF)  spectro-temporal structures  
Multi-task learning for dangerous object detection in autonomous driving 期刊论文
INFORMATION SCIENCES, 2018, 卷号: 432, 期号: *, 页码: 559-571
作者:  Chen, Yaran;  Zhao, Dongbin;  Lv, Le;  Zhang, Qichao
浏览  |  Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:844/408  |  提交时间:2017/12/28
Dangerous Object Detection  Autonomous Driving  Multi-task Learning  Convolutional Neural Network  
基于深度学习语音分离技术的研究现状与进展 期刊论文
自动化学报, 2016, 卷号: 42, 期号: 6, 页码: 819-833
作者:  刘文举;  聂帅;  梁 山;  张学良
浏览  |  Adobe PDF(2275Kb)  |  收藏  |  浏览/下载:156/54  |  提交时间:2020/10/22
神经网络  语音分离  计算听觉场景分析  机器学习  
The analysis of the simplification from the ideal ratio to binary mask in signal-to-noise ratio sense 期刊论文
SPEECH COMMUNICATION, 2014, 卷号: 59, 页码: 22-30
作者:  Liang, Shan;  Liu, WenJu;  Jiang, Wei;  Xue, Wei
浏览  |  Adobe PDF(909Kb)  |  收藏  |  浏览/下载:288/118  |  提交时间:2015/08/12
Ideal Binary Mask  Ideal Ratio Mask  W-disjoint Orthogonality  
The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio 期刊论文
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 卷号: 134, 期号: 5, 页码: EL452-EL458
作者:  Liang, Shan;  Liu, Wenju;  Jiang, Wei;  Xue, Wei
浏览  |  Adobe PDF(109Kb)  |  收藏  |  浏览/下载:284/84  |  提交时间:2015/08/12