CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:  Xu, Jiaming;  Cui, Jian;  Hao, Yunzhe;  Xu, Bo
收藏  |  浏览/下载:37/0  |  提交时间:2024/02/22
Cocktail party problem  target speaker separation  multi-cue guided separation  semi-supervised learning  
Towards Unified Multi-Domain Machine Translation With Mixture of Domain Experts 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 3488-3498
作者:  Lu, Jinliang;  Zhang, Jiajun
收藏  |  浏览/下载:80/0  |  提交时间:2023/12/21
Training  Adaptation models  Transformers  Task analysis  Speech processing  Machine translation  Switches  Machine Translation  Multi-domain  Mixture-of-expert  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:42/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
Music Theory-Inspired Acoustic Representation for Speech Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2534-2547
作者:  Li, Xingfeng;  Shi, Xiaohan;  Hu, Desheng;  Li, Yongwei;  Zhang, Qingchen;  Wang, Zhengxia;  Unoki, Masashi;  Akagi, Masato
收藏  |  浏览/下载:55/0  |  提交时间:2023/11/17
Affective computing  speech emotion recognition  acoustic representation  music theory and speech analysis  
Topic-Oriented Dialogue Summarization 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, 卷号: 31, 页码: 1797 - 1810
作者:  Lin, Haitao;  Zhu, Junnan;  Xiang, Lu;  Zhai, Feifei;  Zhou, Yu;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(3037Kb)  |  收藏  |  浏览/下载:192/77  |  提交时间:2023/06/13
dialogue summarization  abstractive summarization  controllable text generation  natural language processing  
Attention Analysis and Calibration for Transformer in Natural Language Generation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, 页码: 1927-1938
作者:  Yu, Lu;  Jiajun, Zhang;  Jiali, Zeng;  Shuangzhi, Wu;  Chengqing, Zong
Adobe PDF(1978Kb)  |  收藏  |  浏览/下载:111/33  |  提交时间:2023/05/31
神经机器翻译  
Synchronous Inference for Multilingual Neural Machine Translation 期刊论文
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2022, 期号: 30, 页码: 1827
作者:  Wang, Qian;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(1738Kb)  |  收藏  |  浏览/下载:146/47  |  提交时间:2022/12/19
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:196/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
Fact-Driven Abstractive Summarization by Utilizing Multi-Granular Multi-Relational Knowledge 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 1665-1678
作者:  Mao, Qianren;  Li, Jianxin;  Peng, Hao;  He, Shizhu;  Wang, Lihong;  Yu, Philip S.;  Wang, Zheng
收藏  |  浏览/下载:125/0  |  提交时间:2022/07/25
Fact consistency  graph neural network  language model  pointer network  text summarization  
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:238/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control