CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8419-8432
作者:  Lian, Zheng;  Chen, Lan;  Sun, Licai;  Liu, Bin;  Tao, Jianhua
Adobe PDF(3959Kb)  |  收藏  |  浏览/下载:152/3  |  提交时间:2023/11/17
Oral communication  Correlation  Data models  Task analysis  Feature extraction  Tensors  Benchmark testing  Conversational data  graph complete network (GCNet)  incomplete multimodal learning  speaker-sensitive modeling  temporal-sensitive modeling  
基于随机时频掩蔽的 DNN-HMM 声学模型数据扩增 会议论文
, 青海西宁, 2019
作者:  白烨;  易江燕;  陶建华;  温正棋
Adobe PDF(444Kb)  |  收藏  |  浏览/下载:170/37  |  提交时间:2021/06/25
在噪声环境下基于注意力机制的单通道语音去混响方法 会议论文
, 青海西宁, 2019年8月
作者:  范存航;  刘斌;  陶建华;  易江燕;  温正棋
Adobe PDF(740Kb)  |  收藏  |  浏览/下载:233/61  |  提交时间:2021/06/01
Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning 会议论文
, Hong Kong, 24-27 Jan. 2021
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi;  Song, Leichao
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:238/57  |  提交时间:2021/06/01
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(158Kb)  |  收藏  |  浏览/下载:211/60  |  提交时间:2021/06/01
Gated Recurrent Fusion of Spatial and Spectral Features for Multi-channel Speech Separation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(260Kb)  |  收藏  |  浏览/下载:191/54  |  提交时间:2021/06/01
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features 会议论文
, Graz, Austria, September 15–19, 2019
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(320Kb)  |  收藏  |  浏览/下载:137/44  |  提交时间:2021/06/01
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:376/61  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:410/52  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
Deep Noise Tracking Network: A Hybrid Signal Processing/Deep Learning Approach to Speech Enhancement 会议论文
, Hyderabad, India, 2018-9-2~2018-9-6
作者:  Nie S(聂帅);  Shan Liang;  Bin Liu;  Yaping Zhang;  Wenju Liu;  Jianhua Tao
Adobe PDF(925Kb)  |  收藏  |  浏览/下载:95/32  |  提交时间:2020/10/22