CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CID-SIMS: Complex indoor dataset with semantic information and multi-sensor data from a ground wheeled robot viewpoint 期刊论文
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2023, 页码: 19
作者:  Zhang, Yidi;  An, Ning;  Shi, Chenhui;  Wang, Shuo;  Wei, Hao;  Zhang, Pengju;  Meng, Xinrui;  Sun, Zengpeng;  Wang, Jinke;  Liang, Wenliang;  Tang, Fulin;  Wu, Yihong
收藏  |  浏览/下载:105/0  |  提交时间:2024/02/22
Dataset  ground wheeled robots  semantic segmentation  multi-sensor data  simultaneous localization and mapping  3D reconstruction  
Deep learning models of ultrasonography significantly improved the differential diagnosis performance for superficial soft-tissue masses: a retrospective multicenter study 期刊论文
BMC MEDICINE, 2023, 卷号: 21, 期号: 1, 页码: 11
作者:  Long, Bin;  Zhang, Haoyan;  Zhang, Han;  Chen, Wen;  Sun, Yang;  Tang, Rui;  Lin, Yuxuan;  Fu, Qiang;  Yang, Xin;  Cui, Ligang;  Wang, Kun
收藏  |  浏览/下载:81/0  |  提交时间:2023/12/21
Superficial soft-tissue masses  Deep learning model  Ultrasound  Diagnosis  Computer-assisted diagnosis  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:62/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:228/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:273/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
Progressive Neural Networks based Features Prediction for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 北京, 2018-8
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
Adobe PDF(1188Kb)  |  收藏  |  浏览/下载:225/67  |  提交时间:2020/06/27
speech synthesis  progressive neural networks  unit-selection  target cost  
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
浏览  |  Adobe PDF(340Kb)  |  收藏  |  浏览/下载:258/52  |  提交时间:2020/06/27
Deep Metric Learning for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
浏览  |  Adobe PDF(323Kb)  |  收藏  |  浏览/下载:280/62  |  提交时间:2020/06/27
speech synthesis  unit-selection  target cost  deep metric learning  
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
浏览  |  Adobe PDF(154Kb)  |  收藏  |  浏览/下载:353/86  |  提交时间:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis  
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
浏览  |  Adobe PDF(429Kb)  |  收藏  |  浏览/下载:244/86  |  提交时间:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation