CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CID-SIMS: Complex indoor dataset with semantic information and multi-sensor data from a ground wheeled robot viewpoint 期刊论文
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2023, 页码: 19
作者:  Zhang, Yidi;  An, Ning;  Shi, Chenhui;  Wang, Shuo;  Wei, Hao;  Zhang, Pengju;  Meng, Xinrui;  Sun, Zengpeng;  Wang, Jinke;  Liang, Wenliang;  Tang, Fulin;  Wu, Yihong
收藏  |  浏览/下载:55/0  |  提交时间:2024/02/22
Dataset  ground wheeled robots  semantic segmentation  multi-sensor data  simultaneous localization and mapping  3D reconstruction  
Deep learning models of ultrasonography significantly improved the differential diagnosis performance for superficial soft-tissue masses: a retrospective multicenter study 期刊论文
BMC MEDICINE, 2023, 卷号: 21, 期号: 1, 页码: 11
作者:  Long, Bin;  Zhang, Haoyan;  Zhang, Han;  Chen, Wen;  Sun, Yang;  Tang, Rui;  Lin, Yuxuan;  Fu, Qiang;  Yang, Xin;  Cui, Ligang;  Wang, Kun
收藏  |  浏览/下载:48/0  |  提交时间:2023/12/21
Superficial soft-tissue masses  Deep learning model  Ultrasound  Diagnosis  Computer-assisted diagnosis  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:38/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:225/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:182/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
语音伪造与鉴伪的发展与挑战 期刊论文
信息安全学报, 2020, 卷号: 5, 期号: 2, 页码: 28-38
作者:  陶建华;  傅睿博;  易江燕;  王成龙;  汪涛
浏览  |  Adobe PDF(432Kb)  |  收藏  |  浏览/下载:579/104  |  提交时间:2020/06/27
语音伪造  语音鉴伪  发展与挑战  
DeepTrend 2.0: A light-weighted multi-scale traffic prediction model using detrending 期刊论文
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2019, 卷号: 103, 页码: 142-157
作者:  Dai, Xingyuan;  Fu, Rui;  Zhao, Enmin;  Zhang, Zuo;  Lin, Yilun;  Wang, Fei-Yue;  Li, Li
Adobe PDF(5109Kb)  |  收藏  |  浏览/下载:295/26  |  提交时间:2019/09/30
Traffic prediction  Deep learning  Detrending  Multi-scale traffic prediction  
基于内容和声学特征层级融合的自动韵律边界标注 期刊论文
中国语音学报, 2018, 期号: 10, 页码: 103-110
作者:  傅睿博;  陶建华;  温正棋
浏览  |  Adobe PDF(1209Kb)  |  收藏  |  浏览/下载:260/87  |  提交时间:2020/06/27
韵律边界标注  特征层级融合  语料库构建  语音合成  
基于静音时长和文本特征融合的韵律边界自动标注 期刊论文
清华大学学报(自然科学版), 2018, 卷号: 58, 期号: 1, 页码: 61-66,74
作者:  傅睿博;  陶建华;  李雅;  温正棋
浏览  |  Adobe PDF(1160Kb)  |  收藏  |  浏览/下载:274/103  |  提交时间:2020/06/21
韵律边界标注  决策融合  静音时长  语料库构建  语音合成  
ORGM: Occlusion Relational Graphical Model for Human Pose Estimation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 卷号: 26, 期号: 2, 页码: 927-941
作者:  Fu, Lianrui;  Zhang, Junge;  Huang, Kaiqi
浏览  |  Adobe PDF(3253Kb)  |  收藏  |  浏览/下载:303/87  |  提交时间:2017/09/12
Occlusion  Pose Estimation  Spacial Relationship  Mixture  Graphical Model