Knowledge Commons of Institute of Automation,CAS
Supervisory Data Alignment for Text-Independent Voice Conversion | |
Tao, Jianhua; Zhang, Meng; Nurminen, Jani; Tian, Jilei; Wang, Xia | |
发表期刊 | IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING |
2010-07-01 | |
卷号 | 18期号:5页码:932-943 |
文章类型 | Article |
摘要 | We propose new supervisory data alignment methods for text-independent voice conversion which do not need parallel training corpora. Phonetic information is used as a restriction during alignment for mapping the data from the source speaker onto the parameter space of a target speaker. Both linear and nonlinear methods are derived by considering alignment accuracy and topology preservation. For the linear alignment, we consider common phoneme clusters of the source and target space as benchmarks and adapt the source data vector to the target space while maintaining the relative phonetic positions among neighborhood clusters. In order to preserve the topological structure of the source parameter space and improve the stability of conversion and the accuracy of the phonetic mapping, a supervised self-organizing learning algorithm considering phonetic restriction is proposed for iteratively improving the alignment outcome of the previous step. Both the linear and nonlinear methods can also be applied in the cross-lingual case. Evaluation results show that the proposed methods improve the performance of alignment in terms of both alignment accuracy and stability for text-independent voice conversion in intra-lingual and cross-lingual cases. |
关键词 | Data Alignment Self-organized Learning Supervisory Phonetic Restriction Text-independent Voice Conversion |
WOS标题词 | Science & Technology ; Technology |
关键词[WOS] | TRANSFORMATION ; NETWORKS |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Acoustics ; Engineering |
WOS类目 | Acoustics ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000278814600004 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/40952 |
专题 | 多模态人工智能系统全国重点实验室_智能交互 |
推荐引用方式 GB/T 7714 | Tao, Jianhua,Zhang, Meng,Nurminen, Jani,et al. Supervisory Data Alignment for Text-Independent Voice Conversion[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,2010,18(5):932-943. |
APA | Tao, Jianhua,Zhang, Meng,Nurminen, Jani,Tian, Jilei,&Wang, Xia.(2010).Supervisory Data Alignment for Text-Independent Voice Conversion.IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,18(5),932-943. |
MLA | Tao, Jianhua,et al."Supervisory Data Alignment for Text-Independent Voice Conversion".IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 18.5(2010):932-943. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论