CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共35条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
SceneFake: An initial dataset and benchmarks for scene fake audio detection 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 152, 页码: 12
作者:  Yi, Jiangyan;  Wang, Chenglong;  Tao, Jianhua;  Zhang, Chu Yuan;  Fan, Cunhang;  Tian, Zhengkun;  Ma, Haoxin;  Fu, Ruibo
收藏  |  浏览/下载:2/0  |  提交时间:2024/07/04
Scene manipulation  Fake audio detection  Speech enhancement  SceneFake dateset  
WavDepressionNet: Automatic Depression Level Prediction via Raw Speech Signals 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 卷号: 15, 期号: 1, 页码: 285-296
作者:  Niu, Mingyue;  Tao, Jianhua;  Li, Yongwei;  Qin, Yong;  Li, Ya
收藏  |  浏览/下载:0/0  |  提交时间:2024/07/03
Assessment block  depression level prediction  representation block  speech signals  WavDepressionNet  
Emotion selectable end-to-end text-based speech editing 期刊论文
ARTIFICIAL INTELLIGENCE, 2024, 卷号: 329, 页码: 16
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zhang, Chu Yuan
收藏  |  浏览/下载:1/0  |  提交时间:2024/07/03
Emotion selectable  Text-based speech editing  Emotion decoupling  Mask prediction  Few-shot learning  Text-to-speech  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:48/15  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:41/9  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8419-8432
作者:  Lian, Zheng;  Chen, Lan;  Sun, Licai;  Liu, Bin;  Tao, Jianhua
Adobe PDF(3959Kb)  |  收藏  |  浏览/下载:169/5  |  提交时间:2023/11/17
Oral communication  Correlation  Data models  Task analysis  Feature extraction  Tensors  Benchmark testing  Conversational data  graph complete network (GCNet)  incomplete multimodal learning  speaker-sensitive modeling  temporal-sensitive modeling  
SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 卷号: 14, 期号: 3, 页码: 2415-2429
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2103Kb)  |  收藏  |  浏览/下载:146/8  |  提交时间:2023/11/15
Emotion recognition  Feature extraction  Training  Acoustics  Semisupervised learning  Benchmark testing  Hidden Markov models  Semi-supervised multi-modal interaction network (SMIN)  conversational emotion recognition  semi-supervised learning  intra-modal interaction  cross-modal interaction  
PIRNet: Personality-Enhanced Iterative Refinement Network for Emotion Recognition in Conversation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 12
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2013Kb)  |  收藏  |  浏览/下载:412/7  |  提交时间:2022/09/19
Emotion recognition  Iterative methods  Context modeling  Psychology  Oral communication  Logic gates  Learning systems  Contextual information  emotion recognition in conversation (ERC)  iterative method  Personality-enhanced Iterative Refinement Network (PIRNet)  personality influence  
SpecMNet: Spectrum mend network for monaural speech enhancement 期刊论文
APPLIED ACOUSTICS, 2022, 卷号: 194, 页码: 9
作者:  Fan, Cunhang;  Zhang, Hongmei;  Yi, Jiangyan;  Lv, Zhao;  Tao, Jianhua;  Li, Taihao;  Pei, Guanxiong;  Wu, Xiaopei;  Li, Sheng
收藏  |  浏览/下载:258/0  |  提交时间:2022/07/25
Monaural speech enhancement  Speech distortion  Spectrum mend network  SI-SNR  BLSTM  
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2022, 页码: 762-766
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:288/78  |  提交时间:2022/06/14