CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共10条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:66/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:53/12  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:246/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:210/59  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech 期刊论文
SPEECH COMMUNICATION, 2015, 卷号: 72, 页码: 59-73
作者:  Li, Ya;  Tao, Jianhua;  Hirose, Keikichi;  Xu, Xiaoying;  Lai, Wei
收藏  |  浏览/下载:96/0  |  提交时间:2020/10/27
Prosody  Stress  Hierarchical Modeling  Fujisaki Model  Speech Synthesis  
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1039-1052
作者:  Zheng, Yibin;  Li, Ya;  Wen, Zhengqi;  Liu, Bin;  Tao, Jianhua;  Jianhua Tao
收藏  |  浏览/下载:126/0  |  提交时间:2020/10/27
Speech Synthesis  Excitation Parameters  Deep Neural Network Adaptation  Exclamatory Speech  Interrogative Speech  
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 3, 页码: 621-630
作者:  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Bai, Ye
浏览  |  Adobe PDF(907Kb)  |  收藏  |  浏览/下载:451/95  |  提交时间:2019/07/12
Adversarial training  transfer learning  cross-lingual  low-resource  speech recognition  
Features importance analysis for emotional speech classification 期刊论文
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 卷号: 3784, 期号: 0, 页码: 449-457
作者:  Tao, JH;  Kang, YG;  Tao, J;  Picard, RW
浏览  |  Adobe PDF(188Kb)  |  收藏  |  浏览/下载:243/90  |  提交时间:2015/11/06
Emotionspeech  
Affective computing: A review 期刊论文
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 卷号: 3784, 期号: 0, 页码: 981-995
作者:  Tao, JH;  Tan, TN;  Tao, J;  Picard, RW
浏览  |  Adobe PDF(214Kb)  |  收藏  |  浏览/下载:827/489  |  提交时间:2015/11/06
Acreview  
A hybrid GMM and codebook mapping method for spectral conversion 期刊论文
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 卷号: 3784, 期号: 0, 页码: 303-310
作者:  Kang, YG;  Shuang, ZW;  Tao, JH;  Zhang, W;  Xu, B;  Tao, J;  Picard, RW
浏览  |  Adobe PDF(169Kb)  |  收藏  |  浏览/下载:256/70  |  提交时间:2015/11/06
Codebook Mapping Methods