CASIA OpenIR

浏览/检索结果: 共23条,第1-10条 帮助

  只显示已认领条目
已选(0)清除 条数/页:   排序方式:
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:51/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
ICaps-ResLSTM: Improved capsule network and residual LSTM for EEG emotion recognition 期刊论文
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 卷号: 87, 页码: 9
作者:  Fan, Cunhang;  Xie, Heng;  Tao, Jianhua;  Li, Yongwei;  Pei, Guanxiong;  Li, Taihao;  Lv, Zhao
收藏  |  浏览/下载:91/0  |  提交时间:2023/11/15
Electroencephalogram  Emotion recognition  Capsule network  Residual Long-Short Term Memory  
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
作者:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:5/0  |  提交时间:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition 期刊论文
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
作者:  Fan, Cunhang;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:25/0  |  提交时间:2023/11/16
Robust end-to-end ASR  Speech enhancement  Masking and mapping  Speech distortion  Deep spectrum fusion  
Dynamic Domain Adaptation for Class-Aware Cross-Subject and Cross-Session EEG Emotion Recognition 期刊论文
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 卷号: 26, 期号: 12, 页码: 5964-5973
作者:  Li, Zhunan;  Zhu, Enwei;  Jin, Ming;  Fan, Cunhang;  He, Huiguang;  Cai, Ting;  Li, Jinpeng
收藏  |  浏览/下载:205/0  |  提交时间:2023/02/22
Brain-computer interface  emotion recognition  transfer learning  domain adaptation  
SpecMNet: Spectrum mend network for monaural speech enhancement 期刊论文
APPLIED ACOUSTICS, 2022, 卷号: 194, 页码: 9
作者:  Fan, Cunhang;  Zhang, Hongmei;  Yi, Jiangyan;  Lv, Zhao;  Tao, Jianhua;  Li, Taihao;  Pei, Guanxiong;  Wu, Xiaopei;  Li, Sheng
收藏  |  浏览/下载:209/0  |  提交时间:2022/07/25
Monaural speech enhancement  Speech distortion  Spectrum mend network  SI-SNR  BLSTM  
基于多域听觉特征建模的说话人无关语音分离方法研究 学位论文
, 北京市: 中国科学院自动化研究所, 2021
作者:  范存航
Adobe PDF(3377Kb)  |  收藏  |  浏览/下载:244/12  |  提交时间:2021/06/01
说话人无关语音分离  听觉特征建模  深度嵌入式特征  深度注意力融合特征  门控递归融合  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:361/47  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning 会议论文
, Hong Kong, 24-27 Jan. 2021
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi;  Song, Leichao
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:205/48  |  提交时间:2021/06/01
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(158Kb)  |  收藏  |  浏览/下载:179/53  |  提交时间:2021/06/01