CASIA OpenIR  > 模式识别国家重点实验室  > 语音交互
Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement
Liu, Bin1; Tao, Jianhua1; Wen, Zhengqi1; Mo, Fuyuan2; Bin Liu
Source PublicationJOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY
2016-02-01
Volume82Issue:2Pages:141-150
SubtypeArticle
AbstractThis paper presents a speech enhancement approach based on analysis-synthesis framework. An improved multi-band summary correlogram (MBSC) algorithm is proposed for pitch estimation and voiced/unvoiced (V/UV) detection. The proposed pitch detection algorithm achieves a lower pitch detection error compared with the reference algorithm. The denoising autoencoder (DAE) is applied to enhance the line spectrum frequencies (LSFs). The reconstruction loss could be decreased compare with the swallow model. The proposed approach is evaluated using the perceptual evaluation of speech quality (PESQ) and the experimental results show that the proposed approach improves the performance of speech enhancement compared with the conventional speech enhancement approach. In addition, it could be applied to parametric speech coding even at low bit rate and low signal-noise ratio (SNR) environments.
KeywordAnalysis-synthesis Framework Multi-band Summary Correlogram Denoising Autoencoder Speech Enhancement Speech Coding
WOS HeadingsScience & Technology ; Technology
DOI10.1007/s11265-015-1025-1
WOS KeywordSPECTRAL AMPLITUDE ESTIMATOR ; ERROR ; NOISE ; ALGORITHM ; PRIORS
Indexed BySCI
Language英语
Funding OrganizationNational High-Tech Research and Development Program of China(863 Program)(2015AA016305) ; National Natural Science Foundation of China (NSFC)(61425017 ; Major Program for the National Social Science Fund of China(13ZD 189) ; Integration and application of basic science data in Chinese information processing field(XXH12504-1-11) ; 61403386 ; 61305003 ; 61332017 ; 61375027 ; 61273288 ; 61233009 ; 61203258)
WOS Research AreaComputer Science ; Engineering
WOS SubjectComputer Science, Information Systems ; Engineering, Electrical & Electronic
WOS IDWOS:000371299700002
Citation statistics
Document Type期刊论文
Identifierhttp://ir.ia.ac.cn/handle/173211/11357
Collection模式识别国家重点实验室_语音交互
Corresponding AuthorBin Liu
Affiliation1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
2.Chinese Acad Sci, Inst Acoust, Beijing 100190, Peoples R China
Recommended Citation
GB/T 7714
Liu, Bin,Tao, Jianhua,Wen, Zhengqi,et al. Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement[J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY,2016,82(2):141-150.
APA Liu, Bin,Tao, Jianhua,Wen, Zhengqi,Mo, Fuyuan,&Bin Liu.(2016).Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement.JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY,82(2),141-150.
MLA Liu, Bin,et al."Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement".JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY 82.2(2016):141-150.
Files in This Item: Download All
File Name/Size DocType Version Access License
Speech Enhancement B(695KB)期刊论文作者接受稿开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Liu, Bin]'s Articles
[Tao, Jianhua]'s Articles
[Wen, Zhengqi]'s Articles
Baidu academic
Similar articles in Baidu academic
[Liu, Bin]'s Articles
[Tao, Jianhua]'s Articles
[Wen, Zhengqi]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Liu, Bin]'s Articles
[Tao, Jianhua]'s Articles
[Wen, Zhengqi]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.