CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Multi-Scale Permutation Entropy for Audio Deepfake Detection 会议论文
, 韩国首尔, 2024-4-14
作者:  Chenglong Wang;  He JY(何佳毅);  Jiangyan Yi;  Jianhua Tao;  Chu Yuan Zhang;  Xiaohui Zhang
Adobe PDF(997Kb)  |  收藏  |  浏览/下载:68/22  |  提交时间:2024/06/13
GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios 会议论文
, 希腊罗得岛, 2023年6月
作者:  Li GJ(李冠君);  Liu WJ(刘文举);  Yi JY(易江燕);  Tao JH(陶建华)
Adobe PDF(3463Kb)  |  收藏  |  浏览/下载:45/15  |  提交时间:2024/06/06
CLMAD: A Chinese Language Model Adaptation Dataset 会议论文
, 台北, 2018
作者:  Ye Bai;  Jianhua Tao;  Jiangyan Yi;  Zhengqi Wen;  Cunhang Fan
Adobe PDF(157Kb)  |  收藏  |  浏览/下载:120/33  |  提交时间:2021/06/25
Multi-modal Continuous Dimensional Emotion Recognition Using Recurrent Neural Network and Self-Atention Mechanism 会议论文
, Seattle, United States, 12-16 October, 2020
作者:  Licai Sun;  Zheng Lian;  Jianhua Tao;  Bin Liu;  Mingyue Niu
Adobe PDF(1041Kb)  |  收藏  |  浏览/下载:205/63  |  提交时间:2021/06/16
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(158Kb)  |  收藏  |  浏览/下载:240/69  |  提交时间:2021/06/01
Gated Recurrent Fusion of Spatial and Spectral Features for Multi-channel Speech Separation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(260Kb)  |  收藏  |  浏览/下载:224/68  |  提交时间:2021/06/01
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features 会议论文
, Graz, Austria, September 15–19, 2019
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(320Kb)  |  收藏  |  浏览/下载:159/48  |  提交时间:2021/06/01
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:408/64  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
Deep Noise Tracking Network: A Hybrid Signal Processing/Deep Learning Approach to Speech Enhancement 会议论文
, Hyderabad, India, 2018-9-2~2018-9-6
作者:  Nie S(聂帅);  Shan Liang;  Bin Liu;  Yaping Zhang;  Wenju Liu;  Jianhua Tao
浏览  |  Adobe PDF(925Kb)  |  收藏  |  浏览/下载:125/45  |  提交时间:2020/10/22
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
浏览  |  Adobe PDF(340Kb)  |  收藏  |  浏览/下载:277/56  |  提交时间:2020/06/27