CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Self-attention Based Model for Punctuation Prediction Using Word and Speech Embeddings 会议论文
, Brighton, UK, 2019.05.12-2019.05.15
作者:  Jiangyan Yi;  Jianhua Tao
浏览  |  Adobe PDF(273Kb)  |  收藏  |  浏览/下载:46/19  |  提交时间:2020/10/22
Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition 会议论文
, Brighton, UK, 2019.05.12-2019.05.18
作者:  Jiangyan Yi;  Jianhua Tao;  Ye Bai
浏览  |  Adobe PDF(295Kb)  |  收藏  |  浏览/下载:122/53  |  提交时间:2020/10/22
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
浏览  |  Adobe PDF(429Kb)  |  收藏  |  浏览/下载:265/93  |  提交时间:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation  
Boosting Character-Based Chinese Speech Synthesis via Multi-Task Learning and Dictionary Tutoring 会议论文
, 奥地利, 2019.9.15-2019.9.19
作者:  Zou, Yuxiang;  Dong, Linhao;  Xu, Bo
浏览  |  Adobe PDF(637Kb)  |  收藏  |  浏览/下载:300/113  |  提交时间:2020/06/10
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 3, 页码: 621-630
作者:  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Bai, Ye
浏览  |  Adobe PDF(907Kb)  |  收藏  |  浏览/下载:457/97  |  提交时间:2019/07/12
Adversarial training  transfer learning  cross-lingual  low-resource  speech recognition  
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 卷号: 31, 期号: 5, 页码: 996-1009
作者:  Li, Haoran;  Zhu, Junnan;  Ma, Cong;  Zhang, Jiajun;  Zong, Chengqing
浏览  |  Adobe PDF(2826Kb)  |  收藏  |  浏览/下载:442/109  |  提交时间:2019/07/12
Summarization  multimedia  multi-modal  cross-modal  natural language processing  computer vision