CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文
, Washington D.C., USA, 2023-2-9
作者:  Qingyu Wang;  Tielin Zhang;  Minglun Han;  Yi Wang;  Duzhen Zhang;  Bo Xu
Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:128/41  |  提交时间:2023/06/20
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:115/27  |  提交时间:2023/06/21
IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文
, Singapore, 2022.5
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:179/82  |  提交时间:2023/06/07
Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文
, Lisboa, Portugal, October 10–14, 2022
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:227/143  |  提交时间:2023/06/05
Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network 会议论文
, Online, June 6–11, 2021
作者:  Wu HR(吴浩然);  Chen W(陈炜);  Xu S(徐爽);  Xu B(徐波)
Adobe PDF(1394Kb)  |  收藏  |  浏览/下载:135/51  |  提交时间:2023/06/26
Consecutive decoding for speech-to-text translation 会议论文
, Virtual, 2021-2
作者:  Dong QQ(董倩倩);  Mingxuan Wang(王明轩);  Hao Zhou(周浩);  Shuang Xu(徐爽);  Bo Xu(徐波);  Lei Li(李磊)
Adobe PDF(586Kb)  |  收藏  |  浏览/下载:187/62  |  提交时间:2021/06/24
Online Audio-Visual Speech Separation with Generative Adversarial Training 会议论文
0, 线上会议, 2021-4-23
作者:  Zhang Peng;  Xu Jiaming;  Hao Yunzhe;  Xu Bo
Adobe PDF(532Kb)  |  收藏  |  浏览/下载:187/45  |  提交时间:2021/06/21
audio-visual speech separation  online processing  generative adversarial training  causal temporal convolutional network  
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition 会议论文
, 在线会议, 2020-05
作者:  Dong, Linhao;  Xu, Bo
浏览  |  Adobe PDF(641Kb)  |  收藏  |  浏览/下载:292/66  |  提交时间:2020/06/13
continuous integrate-and-fire  end-to-end model  soft and monotonic alignment  online speech recognition  acoustic boundary positioning  
SPEAKER-AWARE SPEECH-TRANSFORMER 会议论文
, 新加坡, 2019-12-14
作者:  Fan ZY(范志赟);  Li J(李杰);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(361Kb)  |  收藏  |  浏览/下载:146/48  |  提交时间:2022/09/17
Speech-Transformer, speaker adaptation, end-to-end speech recognition, speaker aware training, i-vector  
NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition 会议论文
, Sydney, Australia, 2019-9-20 ~ 2019-9-25
作者:  Sheng, Fenfen;  Chen, Zhineng;  Xu, Bo
浏览  |  Adobe PDF(455Kb)  |  收藏  |  浏览/下载:209/61  |  提交时间:2020/06/12