CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
SPEAKER-AWARE SPEECH-TRANSFORMER 会议论文
, 新加坡, 2019-12-14
作者:  Fan ZY(范志赟);  Li J(李杰);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(361Kb)  |  收藏  |  浏览/下载:146/48  |  提交时间:2022/09/17
Speech-Transformer, speaker adaptation, end-to-end speech recognition, speaker aware training, i-vector  
Self-Attention Transducers for End-to-End Speech Recognition 会议论文
, Graz, Austria, September 15–19, 2019
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Ye Bai;  Zhengqi Wen
Adobe PDF(278Kb)  |  收藏  |  浏览/下载:88/36  |  提交时间:2022/06/14
End-to-End Speech Translation with Knowledge Distillation 会议论文
, Graz,Austria, Sep. 15-19, 2019
作者:  Yuchen Liu;  Hao Xiong;  Jiajun Zhang;  Zhongjun He;  Hua Wu;  Haifeng Wang;  Chengqing Zong
Adobe PDF(700Kb)  |  收藏  |  浏览/下载:142/52  |  提交时间:2021/06/01
Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition 会议论文
, Graz, Austria, 2019-9-15
作者:  Liu, Bin;  Nie, Shuai;  Liang, Shan;  Liu, Wenju;  Yu, Meng;  Chen, Lianwu;  Peng, Shouye;  Li, Changliang
浏览  |  Adobe PDF(350Kb)  |  收藏  |  浏览/下载:232/98  |  提交时间:2020/05/15
End-to-end Speech Recognition  Robust Speech Recognition  Speech Enhancement  Generative Adversarial Networks  
多语言语音数据库自动优化方法研究 会议论文
, 青海西宁, 2019-8
作者:  傅睿博;  陶建华;  温正棋;  易江燕;  王诗明;  强春雨
浏览  |  Adobe PDF(542Kb)  |  收藏  |  浏览/下载:344/113  |  提交时间:2020/06/24
语音数据库优化  语音合成  多语言  数据对匹配度  
面向用户行为序列的深度上下文建模 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2019
作者:  崔强
Adobe PDF(12117Kb)  |  收藏  |  浏览/下载:292/16  |  提交时间:2019/06/18
上下文信息  深度学习  用户行为序列  循环神经网络  注意力机制  
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR using Self-attention Network and Chunk-hopping 会议论文
, Brighton, United Kingdom, 2019-05
作者:  Dong, Linhao;  Wang, Feng;  Xu, Bo
浏览  |  Adobe PDF(930Kb)  |  收藏  |  浏览/下载:215/39  |  提交时间:2020/06/13
speech recognition  self-attention network  encoder-decoder  end-to-end  latency-control  
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 卷号: 31, 期号: 5, 页码: 996-1009
作者:  Li, Haoran;  Zhu, Junnan;  Ma, Cong;  Zhang, Jiajun;  Zong, Chengqing
浏览  |  Adobe PDF(2826Kb)  |  收藏  |  浏览/下载:365/97  |  提交时间:2019/07/12
Summarization  multimedia  multi-modal  cross-modal  natural language processing  computer vision  
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 3, 页码: 621-630
作者:  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Bai, Ye
浏览  |  Adobe PDF(907Kb)  |  收藏  |  浏览/下载:373/83  |  提交时间:2019/07/12
Adversarial training  transfer learning  cross-lingual  low-resource  speech recognition  
Adapting translation models for transcript disfluency detection 会议论文
, Hawaii, 2019-2
作者:  Dong QQ(董倩倩);  Feng Wang(王峰);  Zhen Yang(杨振);  Wei Chen(陈炜);  Shuang Xu(徐爽);  Bo Xu(徐波)
Adobe PDF(287Kb)  |  收藏  |  浏览/下载:132/16  |  提交时间:2021/06/24