CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共20条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization 会议论文
, Brno, Czechia, 30 August – 3 September
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(839Kb)  |  收藏  |  浏览/下载:180/41  |  提交时间:2022/06/14
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:163/44  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(158Kb)  |  收藏  |  浏览/下载:178/53  |  提交时间:2021/06/01
Synchronous Transformers for end-to-end Speech Recognition 会议论文
, Barcelona, Spain, 2020.05.04-2020.05.08
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
浏览  |  Adobe PDF(496Kb)  |  收藏  |  浏览/下载:149/47  |  提交时间:2020/10/22
Expression Analysis Based on Face Regions in Real-world Conditions 期刊论文
International Journal of Automation and Computing, 2020, 卷号: 17, 期号: 1, 页码: 96-107
作者:  Zheng Lian;  Ya Li;  Jian-Hua Tao;  Jian Huang;  Ming-Yue Niu
浏览  |  Adobe PDF(1364Kb)  |  收藏  |  浏览/下载:208/42  |  提交时间:2021/02/22
Facial emotion analysis  face areas  class activation map  confusion matrix  concerned area.  
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features 会议论文
, Graz, Austria, September 15–19, 2019
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(320Kb)  |  收藏  |  浏览/下载:115/39  |  提交时间:2021/06/01
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
浏览  |  Adobe PDF(429Kb)  |  收藏  |  浏览/下载:212/74  |  提交时间:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation  
Semi-supervised Ladder Networks for Speech Emotion Recognition 期刊论文
International Journal of Automation and Computing, 2019, 卷号: 16, 期号: 4, 页码: 437-448
作者:  Tao, Jianhua;  Huang, Jian;  Li, Ya;  Lian, Zheng;  Niu, Mingyue
浏览  |  Adobe PDF(1025Kb)  |  收藏  |  浏览/下载:260/55  |  提交时间:2020/06/20
Speech emotion recognition  the ladder network  semi-supervised learning  autoencoder  regularization  
Deep Learning Based Speech Separation via NMF-style Reconstructions 期刊论文
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2018, 卷号: 26, 期号: 11, 页码: 2043-2055
作者:  Shuai Nie;  Shan Liang;  Wenju Liu;  Xueliang Zhang;  Jianhua Tao
浏览  |  Adobe PDF(2922Kb)  |  收藏  |  浏览/下载:190/75  |  提交时间:2020/10/22
Speech separation  deep neural network (DNN)  nonnegative matrix factorization (NMF)  spectro-temporal structures  
Deep Metric Learning for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
浏览  |  Adobe PDF(323Kb)  |  收藏  |  浏览/下载:253/57  |  提交时间:2020/06/27
speech synthesis  unit-selection  target cost  deep metric learning