Recently speech synthesis system is ruled by concatenative synthesis us-ing speech waveforms corpus, based on which various concatenative algorithms,prosody models and establishment of speech database technology are widely stud-ied. Many good synthesizers have been into commercial practice. However thistechnology need a large speech database, which limits its application into PDAand mobile et.al. Traditional parametric synthesis methods such as formant-basedsynthesis can change the parameters on spectrum,meantime the °exibility ofmodifying speech parameters makes formant synthesis need a smaller parametersdatabase to synthesis different speaker style. This paper is oriented to formantsynthesis to do some research work.The main contributions of this thesis include following issues:1° We compared two vocal source parametric models,in order to achieve acredible and exact algorithm,we chose KLGLOTT88 model as a sourcemodel. The model parameters are acquired by minimizing the error betweenreal vocal source and model source.By it the estimation problem is formu-lated as a convex optimization problem.The merit of this method is compu-tational e–ciency and global optimality. Synthesis experiment demonstratethe algorithm is valid and support trusting source excitation for formantsynthesizer.2° Formant synthesis o?ers so many degrees of freedom makes it di–cult toset all of those parameters in a way that yields natural sounding speech forexperiments. A tool is designed to help deal with the problems inherent inthis large dataset of parameters in a way that is optimal for experimentalcontrol.3° Formant synthesis’s merits and °aws are in detail discussed, especially manytypes of sound such as sonorant are di–cultly synthesized by formant syn-thesizer. A method that integrates formant synthesis and waveform con-catenation is offered. listening tests validate that it can produce a good result.In a word, in this thesis, we have made a lot of fruitful attempts and signi?cantprogresses to extract source model parameters on parametric speech synthesis.
修改评论