CASIA OpenIR

浏览/检索结果: 共25条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
SceneFake: An initial dataset and benchmarks for scene fake audio detection 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 152, 页码: 12
作者:  Yi, Jiangyan;  Wang, Chenglong;  Tao, Jianhua;  Zhang, Chu Yuan;  Fan, Cunhang;  Tian, Zhengkun;  Ma, Haoxin;  Fu, Ruibo
收藏  |  浏览/下载:18/0  |  提交时间:2024/07/04
Scene manipulation  Fake audio detection  Speech enhancement  SceneFake dateset  
Emotion selectable end-to-end text-based speech editing 期刊论文
ARTIFICIAL INTELLIGENCE, 2024, 卷号: 329, 页码: 16
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zhang, Chu Yuan
收藏  |  浏览/下载:12/0  |  提交时间:2024/07/03
Emotion selectable  Text-based speech editing  Emotion decoupling  Mask prediction  Few-shot learning  Text-to-speech  
Text Difficulty Study: Do Machines Behave the Same as Humans Regarding Text Difficulty? 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 283-293
作者:  Bowen Chen;  Xiao Ding;  Yi Zhao;  Bo Fu;  Tingmao Lin;  Bing Qin;  Ting Liu
Adobe PDF(1796Kb)  |  收藏  |  浏览/下载:54/11  |  提交时间:2024/04/23
Cognition inspired natural language processing, psycholinguistics, explainability, text difficulty, curriculum learning  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:80/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
Online Optimization in Power Systems With High Penetration of Renewable Generation: Advances and Prospects 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 4, 页码: 839-858
作者:  Zhaojian Wang;  Wei Wei;  John Zhen Fu Pang;  Feng Liu;  Bo Yang;  Xinping Guan;  Shengwei Mei
Adobe PDF(2336Kb)  |  收藏  |  浏览/下载:198/41  |  提交时间:2023/03/22
Feedback optimization  Lyapunov optimization  online convex optimization  online optimization  optimization-guided control  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:246/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
Push-Sum Based Algorithm for Constrained Convex Optimization Problem and Its Potential Application in Smart Grid 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 10, 页码: 1889-1891
作者:  Qian Xu;  Zao Fu;  Bo Zou;  Hongzhe Liu;  Lei Wang
Adobe PDF(430Kb)  |  收藏  |  浏览/下载:177/55  |  提交时间:2022/09/08
An Extended Convex Combination Approach for Quadratic ${{\bm{{\cal{{{L}}}}}}}_{{\bm{2}}}$ Performance Analysis of Switched Uncertain Linear Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 9, 页码: 1706-1709
作者:  Yufang Chang;  Guisheng Zhai;  Lianglin Xiong;  Bo Fu
Adobe PDF(412Kb)  |  收藏  |  浏览/下载:174/61  |  提交时间:2022/08/19
DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 卷号: 44, 期号: 6, 页码: 2938-2952
作者:  Fu, Chaoyou;  Wu, Xiang;  Hu, Yibo;  Huang, Huaibo;  He, Ran
Adobe PDF(2900Kb)  |  收藏  |  浏览/下载:276/100  |  提交时间:2022/06/14
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:289/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control