CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共28条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2022, 页码: 762-766
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:249/75  |  提交时间:2022/06/14
One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition 会议论文
, Tokyo, Japan, 14-17 December 2021
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:158/39  |  提交时间:2022/06/14
MULTI-SCALE AND MULTI-REGION FACIAL DISCRIMINATIVE REPRESENTATION FOR AUTOMATIC DEPRESSION LEVEL PREDICTION 会议论文
, 加拿大多伦多, 2021-6
作者:  MIngyue Niu;  Jianhua Tao;  Bin Liu
Adobe PDF(1629Kb)  |  收藏  |  浏览/下载:161/48  |  提交时间:2021/06/01
语音伪造与鉴伪的发展与挑战 期刊论文
信息安全学报, 2020, 卷号: 5, 期号: 2, 页码: 28-38
作者:  陶建华;  傅睿博;  易江燕;  王成龙;  汪涛
浏览  |  Adobe PDF(432Kb)  |  收藏  |  浏览/下载:579/104  |  提交时间:2020/06/27
语音伪造  语音鉴伪  发展与挑战  
Synchronous Transformers for end-to-end Speech Recognition 会议论文
, Barcelona, Spain, 2020.05.04-2020.05.08
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
浏览  |  Adobe PDF(496Kb)  |  收藏  |  浏览/下载:143/47  |  提交时间:2020/10/22
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:260/57  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training  
Multi-modal Continuous Dimensional Emotion Recognition Using Recurrent Neural Network and Self-Atention Mechanism 会议论文
, Seattle, United States, 12-16 October, 2020
作者:  Licai Sun;  Zheng Lian;  Jianhua Tao;  Bin Liu;  Mingyue Niu
Adobe PDF(1041Kb)  |  收藏  |  浏览/下载:152/48  |  提交时间:2021/06/16
Multimodal Spatiotemporal Representation for Automatic Depression Level Detection 期刊论文
IEEE Transactions on Affective Computing, 2020, 期号: 0, 页码: 0
作者:  Mingyue Niu;  Jianhua Tao;  Bin Liu;  Jian Huang;  Zheng Lian
Adobe PDF(2831Kb)  |  收藏  |  浏览/下载:166/49  |  提交时间:2021/06/01
Multimodal depression detection  Spatio-Temporal Attention  Audio/Video Segment-Level Feature  Eigen Evolution Pooling  Audio/Video Level Feature  Multimodal Attention Feature Fusion  
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features 会议论文
, Graz, Austria, September 15–19, 2019
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(320Kb)  |  收藏  |  浏览/下载:113/39  |  提交时间:2021/06/01
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
Adobe PDF(429Kb)  |  收藏  |  浏览/下载:211/73  |  提交时间:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation