CASIA OpenIR

浏览/检索结果: 共97条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection 期刊论文
NEURAL NETWORKS, 2024, 卷号: 175, 页码: 11
作者:  Fan, Cunhang;  Xue, Jun;  Tao, Jianhua;  Yi, Jiangyan;  Wang, Chenglong;  Zheng, Chengshi;  Lv, Zhao
收藏  |  浏览/下载:31/0  |  提交时间:2024/07/04
ASVspoof  Fake speech detection  Fundamental frequency  Res2Net  
SceneFake: An initial dataset and benchmarks for scene fake audio detection 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 152, 页码: 12
作者:  Yi, Jiangyan;  Wang, Chenglong;  Tao, Jianhua;  Zhang, Chu Yuan;  Fan, Cunhang;  Tian, Zhengkun;  Ma, Haoxin;  Fu, Ruibo
收藏  |  浏览/下载:27/0  |  提交时间:2024/07/04
Scene manipulation  Fake audio detection  Speech enhancement  SceneFake dateset  
Auxiliary Network Enhanced Hierarchical Graph Reinforcement Learning for Vehicle Repositioning 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 13
作者:  Xi, Jinhao;  Zhu, Fenghua;  Ye, Peijun;  Lv, Yisheng;  Xiong, Gang;  Wang, Fei-Yue
收藏  |  浏览/下载:10/0  |  提交时间:2024/07/03
Mobility-on-demand system  vehicle repositioning  hierarchical graph reinforcement learning  auxiliary graph reinforcement learning  
Emotion selectable end-to-end text-based speech editing 期刊论文
ARTIFICIAL INTELLIGENCE, 2024, 卷号: 329, 页码: 16
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zhang, Chu Yuan
收藏  |  浏览/下载:20/0  |  提交时间:2024/07/03
Emotion selectable  Text-based speech editing  Emotion decoupling  Mask prediction  Few-shot learning  Text-to-speech  
不确定性环境下维纳模型的随机变分贝叶斯学习 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 6, 页码: 1185-1198
作者:  刘切;  李俊豪;  王浩;  曾建学;  柴毅
Adobe PDF(2009Kb)  |  收藏  |  浏览/下载:38/19  |  提交时间:2024/07/02
非线性系统辨识  随机优化  变分贝叶斯  维纳模型  
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms 会议论文
, Taiyuan, Shanxi, China, 2024-07-27
作者:  Zhang, Chu Yuan;  Yi, Jiangyan;  Tao, Jianhua;  Wang, Chenglong;  Yan, Xinrui
Adobe PDF(2254Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/06/26
Multi-Scale Permutation Entropy for Audio Deepfake Detection 会议论文
, 韩国首尔, 2024-4-14
作者:  Chenglong Wang;  He JY(何佳毅);  Jiangyan Yi;  Jianhua Tao;  Chu Yuan Zhang;  Xiaohui Zhang
Adobe PDF(997Kb)  |  收藏  |  浏览/下载:68/22  |  提交时间:2024/06/13
GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios 会议论文
, 希腊罗得岛, 2023年6月
作者:  Li GJ(李冠君);  Liu WJ(刘文举);  Yi JY(易江燕);  Tao JH(陶建华)
Adobe PDF(3463Kb)  |  收藏  |  浏览/下载:45/15  |  提交时间:2024/06/06
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning 会议论文
, Ottawa, ON, Canada, October 29-November 3, 2023
作者:  Zheng Lian;  Haiyang Sun;  Licai Sun;  Kang Chen;  Mingyu Xu;  Kexin Wang;  Ke Xu;  Yu He;  Ying Li;  Jinming Zhao;  Ye Liu;  Bin Liu;  Jiangyan Yi;  Meng Wang;  Erik Cambria;  Guoying Zhao;  Björn W. Schuller;  Jianhua Tao
Adobe PDF(993Kb)  |  收藏  |  浏览/下载:62/21  |  提交时间:2024/05/31
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:117/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion