CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
SPEAKER-AWARE SPEECH-TRANSFORMER 会议论文
, 新加坡, 2019-12-14
作者:  Fan ZY(范志赟);  Li J(李杰);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(361Kb)  |  收藏  |  浏览/下载:160/53  |  提交时间:2022/09/17
Speech-Transformer, speaker adaptation, end-to-end speech recognition, speaker aware training, i-vector  
鸡尾酒会问题与相关听觉模型的研究现状与展望 期刊论文
自动化学报, 2019, 卷号: 45, 期号: 2, 页码: 234-251
作者:  黄雅婷;  石晶;  许家铭;  徐波
Adobe PDF(3009Kb)  |  收藏  |  浏览/下载:187/59  |  提交时间:2022/09/17
Self-Attention Transducers for End-to-End Speech Recognition 会议论文
, Graz, Austria, September 15–19, 2019
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Ye Bai;  Zhengqi Wen
Adobe PDF(278Kb)  |  收藏  |  浏览/下载:100/38  |  提交时间:2022/06/14
Conversational Emotion Analysis via Attention Mechanisms 会议论文
, Graz, Austria, 15-19 September, 2019
作者:  Zheng Lian;  Jianhua Tao;  Bin Liu;  Jian Huang
Adobe PDF(317Kb)  |  收藏  |  浏览/下载:151/51  |  提交时间:2021/06/16
Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition 会议论文
, Graz, Austria, 15-19 September, 2019
作者:  Zheng Lian;  Jianhua Tao;  Bin Liu;  Jian Huang
Adobe PDF(373Kb)  |  收藏  |  浏览/下载:87/28  |  提交时间:2021/06/16
Self-attention Based Model for Punctuation Prediction Using Word and Speech Embeddings 会议论文
, Brighton, UK, 2019.05.12-2019.05.15
作者:  Jiangyan Yi;  Jianhua Tao
浏览  |  Adobe PDF(273Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2020/10/22
Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition 会议论文
, Brighton, UK, 2019.05.12-2019.05.18
作者:  Jiangyan Yi;  Jianhua Tao;  Ye Bai
浏览  |  Adobe PDF(295Kb)  |  收藏  |  浏览/下载:86/40  |  提交时间:2020/10/22
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
浏览  |  Adobe PDF(429Kb)  |  收藏  |  浏览/下载:234/82  |  提交时间:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation  
Efficient Modeling of Long Temporal Contexts for Continuous Emotion Recognition 会议论文
, Cambridge, United Kingdom, 2019.9.3-2019.9.6
作者:  Huang, Jian;  Tao, Jianhua;  Liu, Bin;  Lian, Zhen;  Niu, Mingyue
浏览  |  Adobe PDF(420Kb)  |  收藏  |  浏览/下载:227/65  |  提交时间:2020/06/20
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR using Self-attention Network and Chunk-hopping 会议论文
, Brighton, United Kingdom, 2019-05
作者:  Dong, Linhao;  Wang, Feng;  Xu, Bo
浏览  |  Adobe PDF(930Kb)  |  收藏  |  浏览/下载:240/42  |  提交时间:2020/06/13
speech recognition  self-attention network  encoder-decoder  end-to-end  latency-control