CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共12条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios 会议论文
, 希腊罗得岛, 2023年6月
作者:  Li GJ(李冠君);  Liu WJ(刘文举);  Yi JY(易江燕);  Tao JH(陶建华)
Adobe PDF(3463Kb)  |  收藏  |  浏览/下载:34/12  |  提交时间:2024/06/06
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:60/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition 会议论文
, shanghai, 2020
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(801Kb)  |  收藏  |  浏览/下载:141/38  |  提交时间:2021/06/25
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(1163Kb)  |  收藏  |  浏览/下载:211/66  |  提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别  
Conversational Emotion Analysis via Attention Mechanisms 会议论文
, Graz, Austria, 15-19 September, 2019
作者:  Zheng Lian;  Jianhua Tao;  Bin Liu;  Jian Huang
Adobe PDF(317Kb)  |  收藏  |  浏览/下载:182/61  |  提交时间:2021/06/16
DECN: Dialogical Emotion Correction Network for Conversational Emotion Recognition 期刊论文
Neurocomputing, 2021, 期号: 0, 页码: 0
作者:  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2238Kb)  |  收藏  |  浏览/下载:177/35  |  提交时间:2021/06/16
Emotion recognition in conversations (ERC)  Context-sensitive modeling  Dialogical Emotion Correction Network (DECN)  Interaction modeling  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:398/63  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
Progressive Neural Networks based Features Prediction for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 北京, 2018-8
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
浏览  |  Adobe PDF(1188Kb)  |  收藏  |  浏览/下载:240/71  |  提交时间:2020/06/27
speech synthesis  progressive neural networks  unit-selection  target cost  
Deep Metric Learning for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
Adobe PDF(323Kb)  |  收藏  |  浏览/下载:296/69  |  提交时间:2020/06/27
speech synthesis  unit-selection  target cost  deep metric learning  
Efficient Modeling of Long Temporal Contexts for Continuous Emotion Recognition 会议论文
, Cambridge, United Kingdom, 2019.9.3-2019.9.6
作者:  Huang, Jian;  Tao, Jianhua;  Liu, Bin;  Lian, Zhen;  Niu, Mingyue
Adobe PDF(420Kb)  |  收藏  |  浏览/下载:250/67  |  提交时间:2020/06/20