CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:45/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
Reflection Removal via Realistic Training Data Generation 会议论文
, 线上, 2020-08
作者:  Pang YX(庞有鑫);  Yuan MK(袁梦轲);  Fu Q(付强);  Yan DM(严冬明)
Adobe PDF(1636Kb)  |  收藏  |  浏览/下载:80/22  |  提交时间:2023/06/26
Clinical-Coder: Assigning Interpretable ICD-10 Codes to Chinese Clinical Notes 会议论文
, Online, July 5 - 10, 2020
作者:  Pengfei Cao;  Chenwei Yan;  Xiangling Fu;  Yubo Chen;  Kang Liu;  Jun Zhao;  Shengping Liu;  Weifeng Chong
Adobe PDF(1388Kb)  |  收藏  |  浏览/下载:65/22  |  提交时间:2023/06/26
Diffractive lensless imaging with optimized Voronoi-Fresnel phase 期刊论文
OPTICS EXPRESS, 2022, 卷号: 30, 期号: 25, 页码: 45807-45823
作者:  Fu, Qiang;  Yan, Dong-ming;  Heidrich, Wolfgang
收藏  |  浏览/下载:127/0  |  提交时间:2023/03/20
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:207/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:250/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
Progressive polarization based reflection removal via realistic training data generation 期刊论文
PATTERN RECOGNITION, 2022, 卷号: 124, 页码: 13
作者:  Pang, Youxin;  Yuan, Mengke;  Fu, Qiang;  Ren, Peiran;  Yan, Dong-Ming
Adobe PDF(4985Kb)  |  收藏  |  浏览/下载:295/36  |  提交时间:2022/02/16
Deep learning  Reflection removal  Polarization  Progressive network  Convolutional neural networks  
Micro-Expression Recognition Using Color Spaces 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 卷号: 24, 期号: 12, 页码: 6034-6047
作者:  Wang, Su-Jing;  Yan, Wen-Jing;  Li, Xiaobai;  Zhao, Guoying;  Zhou, Chun-Guang;  Fu, Xiaolan;  Yang, Minghao;  Tao, Jianhua
收藏  |  浏览/下载:109/0  |  提交时间:2020/10/27
Micro-expression Recognition  Color Spaces  Tensor Analysis  Local Binary Patterns  Facial Action Coding System  
语音伪造与鉴伪的发展与挑战 期刊论文
信息安全学报, 2020, 卷号: 5, 期号: 2, 页码: 28-38
作者:  陶建华;  傅睿博;  易江燕;  王成龙;  汪涛
浏览  |  Adobe PDF(432Kb)  |  收藏  |  浏览/下载:614/106  |  提交时间:2020/06/27
语音伪造  语音鉴伪  发展与挑战  
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
浏览  |  Adobe PDF(154Kb)  |  收藏  |  浏览/下载:338/86  |  提交时间:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis