CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共5条,第1-5条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
SPEAKER-AWARE SPEECH-TRANSFORMER 会议论文
, 新加坡, 2019-12-14
作者:  Fan ZY(范志赟);  Li J(李杰);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(361Kb)  |  收藏  |  浏览/下载:157/52  |  提交时间:2022/09/17
Speech-Transformer, speaker adaptation, end-to-end speech recognition, speaker aware training, i-vector  
A Unified Multi-output Semi-supervised Network for 3D Face Reconstruction 会议论文
, Budapest, 2019-07
作者:  Wang, Pengrui;  Tian, Yi;  Che, Wujun;  Xu, Bo
浏览  |  Adobe PDF(1744Kb)  |  收藏  |  浏览/下载:267/73  |  提交时间:2020/09/11
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR using Self-attention Network and Chunk-hopping 会议论文
, Brighton, United Kingdom, 2019-05
作者:  Dong, Linhao;  Wang, Feng;  Xu, Bo
浏览  |  Adobe PDF(930Kb)  |  收藏  |  浏览/下载:237/42  |  提交时间:2020/06/13
speech recognition  self-attention network  encoder-decoder  end-to-end  latency-control  
NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition 会议论文
, Sydney, Australia, 2019-9-20 ~ 2019-9-25
作者:  Sheng, Fenfen;  Chen, Zhineng;  Xu, Bo
浏览  |  Adobe PDF(455Kb)  |  收藏  |  浏览/下载:228/65  |  提交时间:2020/06/12
Pyrboxes: An efficient multi-scale scene text detector with feature pyramids 期刊论文
PATTERN RECOGNITION LETTERS, 2019, 卷号: 125, 期号: 2019, 页码: 228-234
作者:  Sheng, Fenfen;  Chen, Zhineng;  Zhang, Wei;  Xu, Bo
Adobe PDF(1558Kb)  |  收藏  |  浏览/下载:320/48  |  提交时间:2019/12/16
Scene text detection  Multi-scale text detection  Grouped pyramid module  Efficient and effective