CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共44条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:42/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:339/46  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning 会议论文
, Hong Kong, 24-27 Jan. 2021
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi;  Song, Leichao
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:191/46  |  提交时间:2021/06/01
Gated Recurrent Fusion of Spatial and Spectral Features for Multi-channel Speech Separation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(260Kb)  |  收藏  |  浏览/下载:157/43  |  提交时间:2021/06/01
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(158Kb)  |  收藏  |  浏览/下载:170/51  |  提交时间:2021/06/01
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:251/55  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training  
Noise Prior Knowledge Learning for Speech Enhancement via Gated Convolutional Generative Adversarial Network 会议论文
, Lanzhou, China, 18-21 Nov. 2019
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(837Kb)  |  收藏  |  浏览/下载:119/40  |  提交时间:2021/06/01
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features 会议论文
, Graz, Austria, September 15–19, 2019
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(320Kb)  |  收藏  |  浏览/下载:111/38  |  提交时间:2021/06/01
在噪声环境下基于注意力机制的单通道语音去混响方法 会议论文
, 青海西宁, 2019年8月
作者:  范存航;  刘斌;  陶建华;  易江燕;  温正棋
Adobe PDF(740Kb)  |  收藏  |  浏览/下载:192/53  |  提交时间:2021/06/01
一种基于卷积神经网络的端到端语音分离方法 期刊论文
信号处理, 2019, 卷号: 35, 期号: 4, 页码: 542-548
作者:  范存航;  刘斌;  陶建华;  温正棋;  易江燕
Adobe PDF(1621Kb)  |  收藏  |  浏览/下载:155/51  |  提交时间:2021/06/01
说话人独立语音分离  鸡尾酒会问题  端到端  卷积编解码器