CASIA OpenIR

Browse/Search Results:  1-10 of 35 Help

Selected(0)Clear Items/Page:    Sort:
拍照票据图像识别方法与系统 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2019
Authors:  王淼
Adobe PDF(4089Kb)  |  Favorite  |  View/Download:106/3  |  Submit date:2019/06/13
图像质量评估  文字检测  文字识别  卷积神经网络  
低资源语言的多语言语音识别建模方法研究 学位论文
, 北京: 中国科学院研究生院, 2018
Authors:  周世玉
Adobe PDF(2353Kb)  |  Favorite  |  View/Download:213/2  |  Submit date:2018/12/20
语音识别  多语言  低资源  跨语言  端到端  多语言语音识别  中 英混合语音识别  Asr  Multilingual  Low-resource  Cross-language  Sequence-to-sequence  Multilingual Speech Recognition  English-mandarin Bilingual Speech Recognition  
Max Margin Cosine Loss for Speaker Identification on Short Utterances 会议论文
, 中国,台湾, 2018-11
Authors:  Ji RF(吉瑞芳);  Cao JH(曹俊华);  Cai XY(蔡新元);  Xu B(徐波)
View  |  Adobe PDF(946Kb)  |  Favorite  |  View/Download:95/49  |  Submit date:2019/04/30
CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 985-997
Authors:  Yi, Jiangyan;  Wen, Zhengqi;  Tao, Jianhua;  Ni, Hao;  Liu, Bin;  Wen ZQ(温正棋)
View  |  Adobe PDF(1416Kb)  |  Favorite  |  View/Download:220/86  |  Submit date:2018/01/04
Multi-accent  Mandarin Speech Recognition  Lstm-rnn-ctc  Model Adaptation  Ctc Regularization  
无权访问的条目 学位论文
Authors:  易江燕
Adobe PDF(2091Kb)  |  Favorite  |  View/Download:13/2  |  Submit date:2018/05/31
语音合成声学建模技术研究 学位论文
, 北京: 中国科学院研究生院, 2018
Authors:  王文富
Adobe PDF(4177Kb)  |  Favorite  |  View/Download:162/6  |  Submit date:2018/06/07
语音合成  声学建模  门控循环混合密度网络  卷积输出层  对抗学习  端到端  
Drawing and Recognizing Chinese Characters with Recurrent Neural Network 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 卷号: 40, 期号: 4, 页码: 849-862
Authors:  Zhang, Xu-Yao;  Yin, Fei;  Zhang, Yan-Ming;  Liu, Cheng-Lin;  Bengio, Yoshua
View  |  Adobe PDF(824Kb)  |  Favorite  |  View/Download:245/128  |  Submit date:2017/09/16
Recurrent Neural Network  Lstm  Gru  Discriminative Model  Generative Model  Handwriting  
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese 会议论文
Interspeech, 印度的海德拉巴, 2018
Authors:  Shiyu Zhou;  Linhao Dong;  Shuang Xu;  Bo Xu
View  |  Adobe PDF(416Kb)  |  Favorite  |  View/Download:94/23  |  Submit date:2018/12/20
Asr  Multi-head Attention  Syllable Based Acoustic Modeling  Sequence-to-sequence  
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese 会议论文
ICONIP, Siem Reap, Cambodia, 2018
Authors:  Shiyu Zhou;  Linhao Dong;  Shuang Xu;  Bo Xu
View  |  Adobe PDF(335Kb)  |  Favorite  |  View/Download:80/18  |  Submit date:2018/12/20
Asr  Multi-head Attention  Modeling Units  Sequence-to-sequence  Transformer  
RGB-D-based Human Motion Recognition with Deep Learning: A Survey 期刊论文
Computer Vision and Image Understanding, 2018, 卷号: PP, 期号: 1, 页码: 1-22
Authors:  Pichao Wang;  Wanqing Li;  Philip Ogunbona;  Jun Wan;  Sergio Escalera
View  |  Adobe PDF(5390Kb)  |  Favorite  |  View/Download:74/4  |  Submit date:2018/10/04
Human Motion Recognition  Rgb-d Data  Deep Learning  Survey