CASIA OpenIR

Browse/Search Results:  1-10 of 280 Help

  Show only claimed items
Selected(0)Clear Items/Page:    Sort:
Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection 期刊论文
NEURAL NETWORKS, 2024, 卷号: 175, 页码: 11
Authors:  Fan, Cunhang;  Xue, Jun;  Tao, Jianhua;  Yi, Jiangyan;  Wang, Chenglong;  Zheng, Chengshi;  Lv, Zhao
Favorite  |  View/Download:23/0  |  Submit date:2024/07/04
ASVspoof  Fake speech detection  Fundamental frequency  Res2Net  
SceneFake: An initial dataset and benchmarks for scene fake audio detection 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 152, 页码: 12
Authors:  Yi, Jiangyan;  Wang, Chenglong;  Tao, Jianhua;  Zhang, Chu Yuan;  Fan, Cunhang;  Tian, Zhengkun;  Ma, Haoxin;  Fu, Ruibo
Favorite  |  View/Download:18/0  |  Submit date:2024/07/04
Scene manipulation  Fake audio detection  Speech enhancement  SceneFake dateset  
WavDepressionNet: Automatic Depression Level Prediction via Raw Speech Signals 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 卷号: 15, 期号: 1, 页码: 285-296
Authors:  Niu, Mingyue;  Tao, Jianhua;  Li, Yongwei;  Qin, Yong;  Li, Ya
Favorite  |  View/Download:7/0  |  Submit date:2024/07/03
Assessment block  depression level prediction  representation block  speech signals  WavDepressionNet  
Emotion selectable end-to-end text-based speech editing 期刊论文
ARTIFICIAL INTELLIGENCE, 2024, 卷号: 329, 页码: 16
Authors:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zhang, Chu Yuan
Favorite  |  View/Download:12/0  |  Submit date:2024/07/03
Emotion selectable  Text-based speech editing  Emotion decoupling  Mask prediction  Few-shot learning  Text-to-speech  
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms 会议论文
, Taiyuan, Shanxi, China, 2024-07-27
Authors:  Zhang, Chu Yuan;  Yi, Jiangyan;  Tao, Jianhua;  Wang, Chenglong;  Yan, Xinrui
Adobe PDF(2254Kb)  |  Favorite  |  View/Download:26/12  |  Submit date:2024/06/26
Self-Talk Responses to Users' Opinions and Challenge in Human Computer Dialog 会议论文
, Beijing, China, 2018-8-2
Authors:  Yang Minghao;  Zhang Ke;  NaShengRuoYang;  Tao Jianhua
Adobe PDF(540Kb)  |  Favorite  |  View/Download:51/13  |  Submit date:2024/06/24
Multi-Scale Permutation Entropy for Audio Deepfake Detection 会议论文
, 韩国首尔, 2024-4-14
Authors:  Chenglong Wang;  He JY(何佳毅);  Jiangyan Yi;  Jianhua Tao;  Chu Yuan Zhang;  Xiaohui Zhang
Adobe PDF(997Kb)  |  Favorite  |  View/Download:51/17  |  Submit date:2024/06/13
End-to-End Network Based on Transformer for Automatic Detection of Covid-19 会议论文
, Singapore, 22-27 May 2022
Authors:  Cong Cai;  Bin Liu;  Jianhua Tao;  Zhengkun Tian;  Jiahao Lu;  Kexin Wang
Adobe PDF(1210Kb)  |  Favorite  |  View/Download:51/14  |  Submit date:2024/06/11
GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios 会议论文
, 希腊罗得岛, 2023年6月
Authors:  Li GJ(李冠君);  Liu WJ(刘文举);  Yi JY(易江燕);  Tao JH(陶建华)
Adobe PDF(3463Kb)  |  Favorite  |  View/Download:37/12  |  Submit date:2024/06/06
MULTIMODAL CROSS- AND SELF-ATTENTION NETWORK FOR SPEECH EMOTION RECOGNITION 会议论文
, Toronto, Canada, 6-12 June 2021
Authors:  Licai Sun;  Bin Liu;  Jianhua Tao;  Zheng Lian
Adobe PDF(1078Kb)  |  Favorite  |  View/Download:40/11  |  Submit date:2024/06/03