CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
CIF-Based Collaborative Decoding for End-to-End Contextual Speech Recognition 会议论文
, Toronto, Canada, 2021-06-06
作者:  Minglun Han;  Linhao Dong;  Shiyu Zhou;  Bo Xu
Adobe PDF(469Kb)  |  收藏  |  浏览/下载:124/37  |  提交时间:2023/05/29
Contextual Speech Recognition  Automatic Speech Recognition  Context Biasing  
Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire 期刊论文
Signal Processing Letters, 2022, 页码: 1551-1554
作者:  Fan ZY(范志赟);  Dong LH(董林昊);  Cai M(蔡猛);  Ma ZJ(马泽君);  Xu B(徐波)
Adobe PDF(404Kb)  |  收藏  |  浏览/下载:160/38  |  提交时间:2022/09/17
A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Yunzhe Hao;  Jiaming Xu;  Jing Shi;  Peng Zhang;  Lei Qin;  Bo Xu
Adobe PDF(399Kb)  |  收藏  |  浏览/下载:205/52  |  提交时间:2022/06/23
WASE: LEARNING WHEN TO ATTEND FOR SPEAKER EXTRACTION IN COCKTAIL PARTY ENVIRONMENTS 会议论文
, Toronto, June 6-11, 2021
作者:  Yunzhe Hao;  Jiaming Xu;  Peng Zhang;  Bo Xu
Adobe PDF(2034Kb)  |  收藏  |  浏览/下载:211/32  |  提交时间:2022/06/23
Consecutive decoding for speech-to-text translation 会议论文
, Virtual, 2021-2
作者:  Dong QQ(董倩倩);  Mingxuan Wang(王明轩);  Hao Zhou(周浩);  Shuang Xu(徐爽);  Bo Xu(徐波);  Lei Li(李磊)
Adobe PDF(586Kb)  |  收藏  |  浏览/下载:203/64  |  提交时间:2021/06/24
Listen, understand and translate: triple supervision decouples end-to-endspeech-to-text translation 会议论文
, Virtual, 2021-2
作者:  Dong QQ(董倩倩);  Rong Ye(叶蓉);  Mingxuan Wang(王明轩);  Hao Zhou(周浩);  Shuang Xu(徐爽);  Bo Xu(徐波);  Lei Li(李磊)
Adobe PDF(991Kb)  |  收藏  |  浏览/下载:184/37  |  提交时间:2021/06/24
Online Audio-Visual Speech Separation with Generative Adversarial Training 会议论文
0, 线上会议, 2021-4-23
作者:  Zhang Peng;  Xu Jiaming;  Hao Yunzhe;  Xu Bo
Adobe PDF(532Kb)  |  收藏  |  浏览/下载:203/47  |  提交时间:2021/06/21
audio-visual speech separation  online processing  generative adversarial training  causal temporal convolutional network  
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文
0, 线上会议, 2021-7-18
作者:  Zhang Peng;  Xu Jiaming;  Shi Jing;  Hao Yunzhe;  Qin Lei;  Xu Bo
Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:207/52  |  提交时间:2021/06/21
audio-visual speech separation  robust  adversarial training method  time-domain approach  
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition 会议论文
, 在线会议, 2020-05
作者:  Dong, Linhao;  Xu, Bo
浏览  |  Adobe PDF(641Kb)  |  收藏  |  浏览/下载:313/68  |  提交时间:2020/06/13
continuous integrate-and-fire  end-to-end model  soft and monotonic alignment  online speech recognition  acoustic boundary positioning  
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR using Self-attention Network and Chunk-hopping 会议论文
, Brighton, United Kingdom, 2019-05
作者:  Dong, Linhao;  Wang, Feng;  Xu, Bo
浏览  |  Adobe PDF(930Kb)  |  收藏  |  浏览/下载:230/42  |  提交时间:2020/06/13
speech recognition  self-attention network  encoder-decoder  end-to-end  latency-control