CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共10条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:177/34  |  提交时间:2023/06/21
Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文
, Dublin, Ireland, 2023-8-20
作者:  Minglun Han;  Feilong Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:197/73  |  提交时间:2023/06/20
IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文
, Singapore, 2022.5
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:242/102  |  提交时间:2023/06/07
Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文
, Lisboa, Portugal, October 10–14, 2022
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:285/157  |  提交时间:2023/06/05
鸡尾酒会问题与相关听觉模型的研究现状与展望 期刊论文
自动化学报, 2019, 卷号: 45, 期号: 2, 页码: 234-251
作者:  黄雅婷;  石晶;  许家铭;  徐波
Adobe PDF(3009Kb)  |  收藏  |  浏览/下载:233/82  |  提交时间:2022/09/17
Train from scratch: Single-stage joint training of speech separation and recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2022, 卷号: 76, 页码: 15
作者:  Shi, Jing;  Chang, Xuankai;  Watanabe, Shinji;  Xu, Bo
收藏  |  浏览/下载:251/0  |  提交时间:2022/07/25
Cocktail party problem  Speech separation  Multi-speaker speech recognition  End-to-end  Joint-training  
Improving Speech Separation with Adversarial Network and Reinforcement Learning 会议论文
, Rio de Janeiro, Brazil, 2018-07
作者:  Liu, Guangcan;  Shi, Jing;  Chen, Xiuyi;  Xu, Jiaming;  Xu, Bo
Adobe PDF(2787Kb)  |  收藏  |  浏览/下载:218/61  |  提交时间:2022/06/27
Distilled Binary Neural Network for Monaural Speech Separation 会议论文
, Rio de Janeiro, Brazil, 2018-07
作者:  Chen, Xiuyi;  Liu, Guangcan;  Shi, Jing;  Xu, Jiaming;  Xu, Bo
Adobe PDF(1770Kb)  |  收藏  |  浏览/下载:214/57  |  提交时间:2022/06/27
A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Yunzhe Hao;  Jiaming Xu;  Jing Shi;  Peng Zhang;  Lei Qin;  Bo Xu
Adobe PDF(399Kb)  |  收藏  |  浏览/下载:255/64  |  提交时间:2022/06/23
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文
0, 线上会议, 2021-7-18
作者:  Zhang Peng;  Xu Jiaming;  Shi Jing;  Hao Yunzhe;  Qin Lei;  Xu Bo
Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:258/69  |  提交时间:2021/06/21
audio-visual speech separation  robust  adversarial training method  time-domain approach