CASIA OpenIR

Browse/Search Results:  1-10 of 195 Help

  Show only claimed items
Selected(0)Clear Items/Page:    Sort:
Multimodal Transformer Learning for Continuous Emotion Recognition 会议论文
, Barcelona, Spain, 2020.5.4-2020.5.8
Authors:  Huang, Jian;  Tao, Jianhua;  Liu, Bin;  Lian, Zheng;  Niu, Mingyue
View  |  Adobe PDF(334Kb)  |  Favorite  |  View/Download:117/42  |  Submit date:2020/06/20
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
Authors:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
View  |  Adobe PDF(154Kb)  |  Favorite  |  View/Download:105/49  |  Submit date:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis  
语音伪造与鉴伪的发展与挑战 期刊论文
信息安全学报, 2020, 卷号: 5, 期号: 2, 页码: 28-38
Authors:  陶建华;  傅睿博;  易江燕;  王成龙;  汪涛
View  |  Adobe PDF(432Kb)  |  Favorite  |  View/Download:122/42  |  Submit date:2020/06/27
语音伪造  语音鉴伪  发展与挑战  
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 页码: 1303-1314
Authors:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Favorite  |  View/Download:22/0  |  Submit date:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training  
Focal Loss for Punctuation Prediction 会议论文
, 北京,中国, 2020.10.25-2020.10.29
Authors:  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Ye Bai;  Cunhang Fan
View  |  Adobe PDF(247Kb)  |  Favorite  |  View/Download:15/3  |  Submit date:2020/10/22
Synchronous Transformers for end-to-end Speech Recognition 会议论文
, Barcelona, Spain, 2020.05.04-2020.05.08
Authors:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
View  |  Adobe PDF(496Kb)  |  Favorite  |  View/Download:11/0  |  Submit date:2020/10/22
Expression Analysis Based on Face Regions in Real-world Conditions 期刊论文
International Journal of Automation and Computing, 2020, 卷号: 17, 期号: 1, 页码: 96-107
Authors:  Zheng Lian;  Ya Li;  Jian-Hua Tao;  Jian Huang;  Ming-Yue Niu
View  |  Adobe PDF(1364Kb)  |  Favorite  |  View/Download:1/0  |  Submit date:2021/02/22
Facial emotion analysis  face areas  class activation map  confusion matrix  concerned area.  
Forward-Backward Decoding Sequence for Regularizing End-to-End TTS 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 12, 页码: 2067-2079
Authors:  Zheng, Yibin;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan
Favorite  |  View/Download:53/0  |  Submit date:2020/03/30
Decoding  Training  Speech processing  Linguistics  Acoustics  Speech recognition  Forward-backward  regularization  encoder-decoder with attention  end-to-end  joint-training  TTS  
A Public Chinese Dataset for Language Model Adaptation 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 页码: 13
Authors:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Fan, Cunhang
Favorite  |  View/Download:47/0  |  Submit date:2019/12/16
Chinese dataset  Language model adaptation  Speech recognition  N-gram  RNNLM  
Deep imitator: handwriting calligraphy imitation via deep attention networks 期刊论文
Pattern Recogniton, 2019, 期号: 已接收, 页码: 已接收
Authors:  Zhao, Bocheng;  Tao, Jianhua;  Yang, Minghao;  Tian, Zhengkun;  Fan, Cunhang;  Bai, Ye
View  |  Adobe PDF(2498Kb)  |  Favorite  |  View/Download:137/53  |  Submit date:2020/01/05
calligraphy imitation, attention, mata-style matrix, condition Gated Recurrent Unit