CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共23条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:51/17  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
GPT-4V with Emotion: A Zero-shot Benchmark for Generalized Emotion Recognition 期刊论文
Information Fusion, 2024, 页码: 1-12
作者:  Zheng Lian;  Licai Sun;  Haiyang Sun;  Kang Chen;  Zhuofan Wen;  Hao Gu;  Bin Liu;  Jianhua Tao
Adobe PDF(6888Kb)  |  收藏  |  浏览/下载:56/8  |  提交时间:2024/05/31
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2022, 页码: 762-766
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:291/79  |  提交时间:2022/06/14
DECN: Dialogical Emotion Correction Network for Conversational Emotion Recognition 期刊论文
Neurocomputing, 2021, 期号: 0, 页码: 0
作者:  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2238Kb)  |  收藏  |  浏览/下载:176/35  |  提交时间:2021/06/16
Emotion recognition in conversations (ERC)  Context-sensitive modeling  Dialogical Emotion Correction Network (DECN)  Interaction modeling  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:392/62  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
面向窄带通信的极低速率语音编码算法研究 期刊论文
信号处理, 2013, 期号: 9, 页码: 1134-1138
作者:  刘斌;  陶建华;  莫福源
收藏  |  浏览/下载:156/0  |  提交时间:2020/10/27
联合矢量量化  非线性量化  预测残差  听觉感知  
Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 卷号: 74, 期号: 3, 页码: 423-435
作者:  Wen, Zhengqi;  Tao, Jianhua;  Pan, Shifeng;  Wang, Yang;  Zhengqi Wen
收藏  |  浏览/下载:28/0  |  提交时间:2020/10/27
Speech Synthesis  Hmm-based Speech Synthesis  Parametric Representation Of Speech  Excitation Model  Pitch-scaled Spectrum  
Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech 期刊论文
SPEECH COMMUNICATION, 2015, 卷号: 72, 页码: 59-73
作者:  Li, Ya;  Tao, Jianhua;  Hirose, Keikichi;  Xu, Xiaoying;  Lai, Wei
收藏  |  浏览/下载:92/0  |  提交时间:2020/10/27
Prosody  Stress  Hierarchical Modeling  Fujisaki Model  Speech Synthesis  
Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 卷号: 82, 期号: 2, 页码: 141-150
作者:  Liu, Bin;  Tao, Jianhua;  Wen, Zhengqi;  Mo, Fuyuan;  Bin Liu
收藏  |  浏览/下载:69/0  |  提交时间:2020/10/27
Analysis-synthesis Framework  Multi-band Summary Correlogram  Denoising Autoencoder  Speech Enhancement  Speech Coding  
Emotional head motion predicting from prosodic and linguistic features 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 卷号: 75, 期号: 9, 页码: 5125-5146
作者:  Yang, Minghao;  Jiang, Jinlin;  Tao, Jianhua;  Mu, Kaihui;  Li, Hao
Adobe PDF(804Kb)  |  收藏  |  浏览/下载:79/4  |  提交时间:2020/10/27
Visual Prosody  Head Gesture  Prosody Clustering