CASIA OpenIR

浏览/检索结果: 共69条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
PIRNet: Personality-Enhanced Iterative Refinement Network for Emotion Recognition in Conversation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 12
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2013Kb)  |  收藏  |  浏览/下载:349/0  |  提交时间:2022/09/19
Emotion recognition  Iterative methods  Context modeling  Psychology  Oral communication  Logic gates  Learning systems  Contextual information  emotion recognition in conversation (ERC)  iterative method  Personality-enhanced Iterative Refinement Network (PIRNet)  personality influence  
Continual Learning for Fake Audio Detection 会议论文
, 线上(捷克), 2021-9
作者:  Ma Haoxin;  Yi Jiangyan;  Tao Jianhua;  Bai Ye;  Tian Zhengkun;  Wang Chenglong
Adobe PDF(2113Kb)  |  收藏  |  浏览/下载:249/63  |  提交时间:2022/06/20
fake audio detection  continual learning  detecting fake without forgetting  
Decoupling_Pronunciation_and_Language_for_End-to-End_Code-Switching_Automatic_Speech_Recognition 会议论文
, Toronto, ON, Canada, 2021-6-11
作者:  Shuai Zhang
Adobe PDF(1462Kb)  |  收藏  |  浏览/下载:119/34  |  提交时间:2022/06/17
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:262/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
F-0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 3375-3383
作者:  Li, Yongwei;  Tao, Jianhua;  Erickson, Donna;  Liu, Bin;  Akagi, Masato
收藏  |  浏览/下载:130/0  |  提交时间:2021/12/28
Speech recognition  Iterative methods  Production  Estimation  Brain modeling  Shape  Low-frequency noise  Glottal source  vocal tract  source-filter model  ARX-LF model  
Self-supervised graph representation learning via bootstrapping 期刊论文
NEUROCOMPUTING, 2021, 卷号: 456, 页码: 88-96
作者:  Che, Feihu;  Yang, Guohua;  Zhang, Dawei;  Tao, Jianhua;  Liu, Tong
Adobe PDF(1379Kb)  |  收藏  |  浏览/下载:366/60  |  提交时间:2021/11/03
Graph representation learning  Self-supervised  Bootstrapping  Graph neural network  
面向交互场景的情感识别研究 学位论文
, 中国科学院自动化研究所: 中国科学院自动化研究所, 2021
作者:  连政
Adobe PDF(4140Kb)  |  收藏  |  浏览/下载:196/15  |  提交时间:2021/06/16
交互场景  情感识别  情感特征提取  多模态融合  个体信息建模  
DECN: Dialogical Emotion Correction Network for Conversational Emotion Recognition 期刊论文
Neurocomputing, 2021, 期号: 0, 页码: 0
作者:  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2238Kb)  |  收藏  |  浏览/下载:153/29  |  提交时间:2021/06/16
Emotion recognition in conversations (ERC)  Context-sensitive modeling  Dialogical Emotion Correction Network (DECN)  Interaction modeling  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:183/52  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
Multimodal Spatiotemporal Representation for Automatic Depression Level Detection 期刊论文
IEEE Transactions on Affective Computing, 2020, 期号: 0, 页码: 0
作者:  Mingyue Niu;  Jianhua Tao;  Bin Liu;  Jian Huang;  Zheng Lian
Adobe PDF(2831Kb)  |  收藏  |  浏览/下载:191/51  |  提交时间:2021/06/01
Multimodal depression detection  Spatio-Temporal Attention  Audio/Video Segment-Level Feature  Eigen Evolution Pooling  Audio/Video Level Feature  Multimodal Attention Feature Fusion