CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共12条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios 会议论文
, 希腊罗得岛, 2023年6月
作者:  Li GJ(李冠君);  Liu WJ(刘文举);  Yi JY(易江燕);  Tao JH(陶建华)
Adobe PDF(3463Kb)  |  收藏  |  浏览/下载:39/13  |  提交时间:2024/06/06
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:69/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8419-8432
作者:  Lian, Zheng;  Chen, Lan;  Sun, Licai;  Liu, Bin;  Tao, Jianhua
Adobe PDF(3959Kb)  |  收藏  |  浏览/下载:186/10  |  提交时间:2023/11/17
Oral communication  Correlation  Data models  Task analysis  Feature extraction  Tensors  Benchmark testing  Conversational data  graph complete network (GCNet)  incomplete multimodal learning  speaker-sensitive modeling  temporal-sensitive modeling  
Conversational Emotion Analysis via Attention Mechanisms 会议论文
, Graz, Austria, 15-19 September, 2019
作者:  Zheng Lian;  Jianhua Tao;  Bin Liu;  Jian Huang
Adobe PDF(317Kb)  |  收藏  |  浏览/下载:184/61  |  提交时间:2021/06/16
Context-Dependent Domain Adversarial Neural Network for Multimodal Emotion Recognition 会议论文
, Shanghai, China, 25-29 October, 2020
作者:  Zheng Lian;  Jianhua Tao;  Bin Liu;  Jian Huang;  Zhanlei Yang;  Rongjun Li
Adobe PDF(348Kb)  |  收藏  |  浏览/下载:198/59  |  提交时间:2021/06/16
Conversational Emotion Recognition Using Self-Attention Mechanisms and Graph Neural Networks 会议论文
, Shanghai, China, 25-29 October, 2020
作者:  Zheng Lian;  Jianhua Tao;  Bin Liu;  Jian Huang;  Zhanlei Yang;  Rongjun Li
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:167/52  |  提交时间:2021/06/16
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:404/63  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
Emotional head motion predicting from prosodic and linguistic features 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 卷号: 75, 期号: 9, 页码: 5125-5146
作者:  Yang, Minghao;  Jiang, Jinlin;  Tao, Jianhua;  Mu, Kaihui;  Li, Hao
Adobe PDF(804Kb)  |  收藏  |  浏览/下载:86/7  |  提交时间:2020/10/27
Visual Prosody  Head Gesture  Prosody Clustering  
Adversarial Multilingual Training for Low-Resource Speech Recognition 会议论文
, Calgary, AB, Canada, 2018.04.15-2018.04.20
作者:  Jiangyan Yi;  Jianhua Tao;  Zhengqi Wen;  Ye Bai
Adobe PDF(1343Kb)  |  收藏  |  浏览/下载:35/18  |  提交时间:2020/10/22
Self-attention Based Model for Punctuation Prediction Using Word and Speech Embeddings 会议论文
, Brighton, UK, 2019.05.12-2019.05.15
作者:  Jiangyan Yi;  Jianhua Tao
Adobe PDF(273Kb)  |  收藏  |  浏览/下载:45/18  |  提交时间:2020/10/22