Supervisory Data Alignment for Text-Independent Voice Conversion
Tao, Jianhua; Zhang, Meng; Nurminen, Jani; Tian, Jilei; Wang, Xia
发表期刊IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
2010-07-01
卷号18期号:5页码:932-943
文章类型Article
摘要We propose new supervisory data alignment methods for text-independent voice conversion which do not need parallel training corpora. Phonetic information is used as a restriction during alignment for mapping the data from the source speaker onto the parameter space of a target speaker. Both linear and nonlinear methods are derived by considering alignment accuracy and topology preservation. For the linear alignment, we consider common phoneme clusters of the source and target space as benchmarks and adapt the source data vector to the target space while maintaining the relative phonetic positions among neighborhood clusters. In order to preserve the topological structure of the source parameter space and improve the stability of conversion and the accuracy of the phonetic mapping, a supervised self-organizing learning algorithm considering phonetic restriction is proposed for iteratively improving the alignment outcome of the previous step. Both the linear and nonlinear methods can also be applied in the cross-lingual case. Evaluation results show that the proposed methods improve the performance of alignment in terms of both alignment accuracy and stability for text-independent voice conversion in intra-lingual and cross-lingual cases.
关键词Data Alignment Self-organized Learning Supervisory Phonetic Restriction Text-independent Voice Conversion
WOS标题词Science & Technology ; Technology
关键词[WOS]TRANSFORMATION ; NETWORKS
收录类别SCI
语种英语
WOS研究方向Acoustics ; Engineering
WOS类目Acoustics ; Engineering, Electrical & Electronic
WOS记录号WOS:000278814600004
引用统计
被引频次:18[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/40952
专题多模态人工智能系统全国重点实验室_智能交互
推荐引用方式
GB/T 7714
Tao, Jianhua,Zhang, Meng,Nurminen, Jani,et al. Supervisory Data Alignment for Text-Independent Voice Conversion[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,2010,18(5):932-943.
APA Tao, Jianhua,Zhang, Meng,Nurminen, Jani,Tian, Jilei,&Wang, Xia.(2010).Supervisory Data Alignment for Text-Independent Voice Conversion.IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,18(5),932-943.
MLA Tao, Jianhua,et al."Supervisory Data Alignment for Text-Independent Voice Conversion".IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 18.5(2010):932-943.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Tao, Jianhua]的文章
[Zhang, Meng]的文章
[Nurminen, Jani]的文章
百度学术
百度学术中相似的文章
[Tao, Jianhua]的文章
[Zhang, Meng]的文章
[Nurminen, Jani]的文章
必应学术
必应学术中相似的文章
[Tao, Jianhua]的文章
[Zhang, Meng]的文章
[Nurminen, Jani]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。