CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共21条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163
作者:  Xu, Jiaming;  Cui, Jian;  Hao, Yunzhe;  Xu, Bo
收藏  |  浏览/下载:38/0  |  提交时间:2024/02/22
Cocktail party problem  target speaker separation  multi-cue guided separation  semi-supervised learning  
A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 153-168
作者:  Zefa Hu;  Ziyi Ni;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1525Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/04/23
Medical dialogue understanding, information extraction, text generation, knowledge-enhanced prompt, low-resource setting, data augmentation  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:130/28  |  提交时间:2023/06/21
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
Train from scratch: Single-stage joint training of speech separation and recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2022, 卷号: 76, 页码: 15
作者:  Shi, Jing;  Chang, Xuankai;  Watanabe, Shinji;  Xu, Bo
收藏  |  浏览/下载:205/0  |  提交时间:2022/07/25
Cocktail party problem  Speech separation  Multi-speaker speech recognition  End-to-end  Joint-training  
Compressing Speaker Extraction Model with Ultra-low Precision Quantization and Knowledge Distillation 期刊论文
Neural Networks, 2022, 卷号: 154, 页码: 13-21
作者:  Yating Huang;  Yunzhe Hao;  Jiaming Xu;  Bo Xu
Adobe PDF(801Kb)  |  收藏  |  浏览/下载:182/48  |  提交时间:2022/09/17
Towards Modeling Auditory Restoration in Noisy Environments 会议论文
, 线上会议, Jul 18, 2021
作者:  Yating Huang;  Yunzhe Hao;  Jiaming Xu;  Bo Xu
Adobe PDF(628Kb)  |  收藏  |  浏览/下载:173/36  |  提交时间:2022/09/17
WASE: LEARNING WHEN TO ATTEND FOR SPEAKER EXTRACTION IN COCKTAIL PARTY ENVIRONMENTS 会议论文
, Toronto, June 6-11, 2021
作者:  Yunzhe Hao;  Jiaming Xu;  Peng Zhang;  Bo Xu
Adobe PDF(2034Kb)  |  收藏  |  浏览/下载:205/31  |  提交时间:2022/06/23
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文
0, 线上会议, 2021-7-18
作者:  Zhang Peng;  Xu Jiaming;  Shi Jing;  Hao Yunzhe;  Qin Lei;  Xu Bo
Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:203/50  |  提交时间:2021/06/21
audio-visual speech separation  robust  adversarial training method  time-domain approach  
Online Audio-Visual Speech Separation with Generative Adversarial Training 会议论文
0, 线上会议, 2021-4-23
作者:  Zhang Peng;  Xu Jiaming;  Hao Yunzhe;  Xu Bo
Adobe PDF(532Kb)  |  收藏  |  浏览/下载:198/46  |  提交时间:2021/06/21
audio-visual speech separation  online processing  generative adversarial training  causal temporal convolutional network