CASIA OpenIR

浏览/检索结果: 共44条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
A dual-mode real-time lip-sync system for a bionic dinosaur robot 会议论文
, Xi'an, China, 2018年11月30日-2018年12月2日
作者:  Yan, Shuaizheng;  Hao, Jiasheng;  Wu, Zhengxing
Adobe PDF(700Kb)  |  收藏  |  浏览/下载:54/13  |  提交时间:2023/06/12
Dinosaurs  Robots  Time-domain analysis  Frequency-domain analysis  Feature extraction  
Norm-based Noisy Corpora Filtering and Refurbishing in Neural Machine Translation 会议论文
, 线上, 2022-12
作者:  Yu, Lu;  Jiajun, Zhang
Adobe PDF(983Kb)  |  收藏  |  浏览/下载:87/29  |  提交时间:2023/05/31
神经机器翻译  
Research on Voiceprint Recognition of Camouflage Voice Based on Deep Belief Network 期刊论文
International Journal of Automation and Computing, 2021, 卷号: 18, 期号: 6, 页码: 947-962
作者:  Nan Jiang;  Ting Liu
Adobe PDF(1905Kb)  |  收藏  |  浏览/下载:206/46  |  提交时间:2021/11/26
Disguised voice recognition  deep belief network  feature extraction  Gammatone frequency cepstrum coefficients (GFCC)  dropout  
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(1163Kb)  |  收藏  |  浏览/下载:189/56  |  提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别  
DECN: Dialogical Emotion Correction Network for Conversational Emotion Recognition 期刊论文
Neurocomputing, 2021, 期号: 0, 页码: 0
作者:  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2238Kb)  |  收藏  |  浏览/下载:152/29  |  提交时间:2021/06/16
Emotion recognition in conversations (ERC)  Context-sensitive modeling  Dialogical Emotion Correction Network (DECN)  Interaction modeling  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:356/59  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 985-997
作者:  Jiangyan Yi;  Zhengqi Wen;  Jianhua Tao;  Hao Ni;  Bin Liu
浏览  |  Adobe PDF(1416Kb)  |  收藏  |  浏览/下载:144/56  |  提交时间:2020/10/22
multi-accent, Mandarin speech recognition,LSTM-RNN-CTC, model adaptation, CTC regularization  
Deep Learning Based Speech Separation via NMF-style Reconstructions 期刊论文
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2018, 卷号: 26, 期号: 11, 页码: 2043-2055
作者:  Shuai Nie;  Shan Liang;  Wenju Liu;  Xueliang Zhang;  Jianhua Tao
浏览  |  Adobe PDF(2922Kb)  |  收藏  |  浏览/下载:210/80  |  提交时间:2020/10/22
Speech separation  deep neural network (DNN)  nonnegative matrix factorization (NMF)  spectro-temporal structures  
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
Adobe PDF(340Kb)  |  收藏  |  浏览/下载:246/51  |  提交时间:2020/06/27
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:298/64  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training