CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共33条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Continual Learning for Fake Audio Detection 会议论文
, 线上(捷克), 2021-9
作者:  Ma Haoxin;  Yi Jiangyan;  Tao Jianhua;  Bai Ye;  Tian Zhengkun;  Wang Chenglong
Adobe PDF(2113Kb)  |  收藏  |  浏览/下载:224/58  |  提交时间:2022/06/20
fake audio detection  continual learning  detecting fake without forgetting  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:163/44  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
浏览  |  Adobe PDF(154Kb)  |  收藏  |  浏览/下载:329/84  |  提交时间:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis  
语音伪造与鉴伪的发展与挑战 期刊论文
信息安全学报, 2020, 卷号: 5, 期号: 2, 页码: 28-38
作者:  陶建华;  傅睿博;  易江燕;  王成龙;  汪涛
浏览  |  Adobe PDF(432Kb)  |  收藏  |  浏览/下载:598/105  |  提交时间:2020/06/27
语音伪造  语音鉴伪  发展与挑战  
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition 会议论文
, shanghai, 2020
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(801Kb)  |  收藏  |  浏览/下载:109/28  |  提交时间:2021/06/25
A Public Chinese Dataset for Language Model Adaptation 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 页码: 13
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Fan, Cunhang
收藏  |  浏览/下载:265/0  |  提交时间:2019/12/16
Chinese dataset  Language model adaptation  Speech recognition  N-gram  RNNLM  
普通话语音识别中的神经网络语言模型的比较研究 会议论文
, 青海西宁, 2019-8
作者:  马浩鑫;  白烨;  易江燕;  陶建华
Adobe PDF(621Kb)  |  收藏  |  浏览/下载:184/58  |  提交时间:2022/06/20
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
浏览  |  Adobe PDF(429Kb)  |  收藏  |  浏览/下载:212/74  |  提交时间:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation  
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 3, 页码: 621-630
作者:  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Bai, Ye
浏览  |  Adobe PDF(907Kb)  |  收藏  |  浏览/下载:388/84  |  提交时间:2019/07/12
Adversarial training  transfer learning  cross-lingual  low-resource  speech recognition  
Semi-supervised Ladder Networks for Speech Emotion Recognition 期刊论文
International Journal of Automation and Computing, 2019, 卷号: 16, 期号: 4, 页码: 437-448
作者:  Tao, Jianhua;  Huang, Jian;  Li, Ya;  Lian, Zheng;  Niu, Mingyue
Adobe PDF(1025Kb)  |  收藏  |  浏览/下载:260/55  |  提交时间:2020/06/20
Speech emotion recognition  the ladder network  semi-supervised learning  autoencoder  regularization