CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
CID-SIMS: Complex indoor dataset with semantic information and multi-sensor data from a ground wheeled robot viewpoint 期刊论文
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2023, 页码: 19
作者:  Zhang, Yidi;  An, Ning;  Shi, Chenhui;  Wang, Shuo;  Wei, Hao;  Zhang, Pengju;  Meng, Xinrui;  Sun, Zengpeng;  Wang, Jinke;  Liang, Wenliang;  Tang, Fulin;  Wu, Yihong
收藏  |  浏览/下载:67/0  |  提交时间:2024/02/22
Dataset  ground wheeled robots  semantic segmentation  multi-sensor data  simultaneous localization and mapping  3D reconstruction  
Deep learning models of ultrasonography significantly improved the differential diagnosis performance for superficial soft-tissue masses: a retrospective multicenter study 期刊论文
BMC MEDICINE, 2023, 卷号: 21, 期号: 1, 页码: 11
作者:  Long, Bin;  Zhang, Haoyan;  Zhang, Han;  Chen, Wen;  Sun, Yang;  Tang, Rui;  Lin, Yuxuan;  Fu, Qiang;  Yang, Xin;  Cui, Ligang;  Wang, Kun
收藏  |  浏览/下载:58/0  |  提交时间:2023/12/21
Superficial soft-tissue masses  Deep learning model  Ultrasound  Diagnosis  Computer-assisted diagnosis  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:42/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:197/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:238/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
The NLPR Speech Synthesis entry for Blizzard Challenge 2017 会议论文
, Stockholm, Sweden, 2017.8.25
作者:  Jianhua Tao;  Ruibo Fu;  Yibin Zheng;  Zhengqi Wen;  Ya Li;  Biu Liu
收藏  |  浏览/下载:68/0  |  提交时间:2020/10/27
语音伪造与鉴伪的发展与挑战 期刊论文
信息安全学报, 2020, 卷号: 5, 期号: 2, 页码: 28-38
作者:  陶建华;  傅睿博;  易江燕;  王成龙;  汪涛
浏览  |  Adobe PDF(432Kb)  |  收藏  |  浏览/下载:604/105  |  提交时间:2020/06/27
语音伪造  语音鉴伪  发展与挑战  
基于内容和声学特征层级融合的自动韵律边界标注 期刊论文
中国语音学报, 2018, 期号: 10, 页码: 103-110
作者:  傅睿博;  陶建华;  温正棋
Adobe PDF(1209Kb)  |  收藏  |  浏览/下载:265/88  |  提交时间:2020/06/27
韵律边界标注  特征层级融合  语料库构建  语音合成  
Progressive Neural Networks based Features Prediction for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 北京, 2018-8
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
浏览  |  Adobe PDF(1188Kb)  |  收藏  |  浏览/下载:197/61  |  提交时间:2020/06/27
speech synthesis  progressive neural networks  unit-selection  target cost  
基于静音时长和文本特征融合的韵律边界自动标注 会议论文
, 江苏连云港, 2017-10
作者:  傅睿博;  李雅;  温正棋;  陶建华
Adobe PDF(877Kb)  |  收藏  |  浏览/下载:223/80  |  提交时间:2020/06/27