CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共10条,第1-10条 帮助

限定条件                                    
已选(0)清除 条数/页:   排序方式:
F-0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 3375-3383
作者:  Li, Yongwei;  Tao, Jianhua;  Erickson, Donna;  Liu, Bin;  Akagi, Masato
收藏  |  浏览/下载:152/0  |  提交时间:2021/12/28
Speech recognition  Iterative methods  Production  Estimation  Brain modeling  Shape  Low-frequency noise  Glottal source  vocal tract  source-filter model  ARX-LF model  
Intelligent Signal Processing for Affective Computing [From the Guest Editors] 期刊论文
IEEE SIGNAL PROCESSING MAGAZINE, 2021, 卷号: 38, 期号: 6, 页码: 9-11
作者:  Schuller, Bjoern W.;  Picard, Rosalind;  Andre, Elisabeth;  Gratch, Jonathan;  Tao, Jianhua
收藏  |  浏览/下载:131/0  |  提交时间:2021/12/28
Deep Learning for Mobile Mental Health: Challenges and recent advances 期刊论文
IEEE SIGNAL PROCESSING MAGAZINE, 2021, 卷号: 38, 期号: 6, 页码: 96-105
作者:  Han, Jing;  Zhang, Zixing;  Mascolo, Cecilia;  Andre, Elisabeth;  Tao, Jianhua;  Zhao, Ziping;  Schuller, Bjoern W.
收藏  |  浏览/下载:153/0  |  提交时间:2021/12/28
Multi-aspect self-supervised learning for heterogeneous information network 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2021, 卷号: 233, 页码: 14
作者:  Che, Feihu;  Tao, Jianhua;  Yang, Guohua;  Liu, Tong;  Zhang, Dawei
Adobe PDF(2661Kb)  |  收藏  |  浏览/下载:251/51  |  提交时间:2021/12/28
Heterogeneous information network  Self-supervised  Contrastive learning  Graph neural network  
Design and Analysis of a Human-Machine Interaction System for Researching Human's Dynamic Emotion 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 卷号: 51, 期号: 10, 页码: 6111-6121
作者:  Sun, Xiao;  Pei, Zhengmeng;  Zhang, Chen;  Li, Guoqiang;  Tao, Jianhua
收藏  |  浏览/下载:212/0  |  提交时间:2021/11/04
Heuristic algorithms  Hidden Markov models  Robots  Sun  Vehicle dynamics  Task analysis  Deep learning  Emotional guidance  emotional transfer  human-machine interaction  Markov chain Monte Carlo  
Self-supervised graph representation learning via bootstrapping 期刊论文
NEUROCOMPUTING, 2021, 卷号: 456, 页码: 88-96
作者:  Che, Feihu;  Yang, Guohua;  Zhang, Dawei;  Tao, Jianhua;  Liu, Tong
Adobe PDF(1379Kb)  |  收藏  |  浏览/下载:396/68  |  提交时间:2021/11/03
Graph representation learning  Self-supervised  Bootstrapping  Graph neural network  
Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 1340-1351
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Tian, Zhengkun;  Zhang, Shuai
收藏  |  浏览/下载:196/0  |  提交时间:2021/06/07
End-to-End  language modeling  speech recognition  teacher-student learning  transfer learning  
Exploiting the directional coherence function for multichannel source extraction 期刊论文
SPEECH COMMUNICATION, 2021, 卷号: 128, 页码: 1-14
作者:  Liang, Shan;  Li, Guanjun;  Nie, Shuai;  Yang, ZhanLei;  Liu, WenJu;  Tao, Jianhua
收藏  |  浏览/下载:233/0  |  提交时间:2021/05/06
Directional coherence function  Coherent-to-Diffuse Ratio  General sidelobe canceller  Desired Speech Presence Probability  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:392/62  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:430/56  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer