已选(0)清除
条数/页: 排序方式: |
| Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文 , Washington D.C., USA, 2023-2-9 作者: Qingyu Wang; Tielin Zhang; Minglun Han; Yi Wang; Duzhen Zhang; Bo Xu Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:128/41  |  提交时间:2023/06/20 |
| VLP: A Survey on Vision-language Pre-training 期刊论文 Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56 作者: Feilong Chen; Duzhen Zhang; Minglun Han; Xiuyi Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(969Kb)  |  收藏  |  浏览/下载:115/27  |  提交时间:2023/06/21 |
| IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文 , Singapore, 2022.5 作者: Feilong Chen; Duzhen Zhang; Xiuyi Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:179/82  |  提交时间:2023/06/07 |
| Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文 , Lisboa, Portugal, October 10–14, 2022 作者: Feilong Chen; Duzhen Zhang; Xiuyi Chen; Jing Shi; Shang Xu; Bo Xu Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:227/143  |  提交时间:2023/06/05 |
| Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network 会议论文 , Online, June 6–11, 2021 作者: Wu HR(吴浩然); Chen W(陈炜); Xu S(徐爽); Xu B(徐波) Adobe PDF(1394Kb)  |  收藏  |  浏览/下载:135/51  |  提交时间:2023/06/26 |
| Consecutive decoding for speech-to-text translation 会议论文 , Virtual, 2021-2 作者: Dong QQ(董倩倩); Mingxuan Wang(王明轩); Hao Zhou(周浩); Shuang Xu(徐爽); Bo Xu(徐波); Lei Li(李磊) Adobe PDF(586Kb)  |  收藏  |  浏览/下载:187/62  |  提交时间:2021/06/24 |
| Online Audio-Visual Speech Separation with Generative Adversarial Training 会议论文 0, 线上会议, 2021-4-23 作者: Zhang Peng; Xu Jiaming; Hao Yunzhe; Xu Bo Adobe PDF(532Kb)  |  收藏  |  浏览/下载:187/45  |  提交时间:2021/06/21 audio-visual speech separation online processing generative adversarial training causal temporal convolutional network |
| CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition 会议论文 , 在线会议, 2020-05 作者: Dong, Linhao; Xu, Bo 浏览  |  Adobe PDF(641Kb)  |  收藏  |  浏览/下载:292/66  |  提交时间:2020/06/13 continuous integrate-and-fire end-to-end model soft and monotonic alignment online speech recognition acoustic boundary positioning |
| SPEAKER-AWARE SPEECH-TRANSFORMER 会议论文 , 新加坡, 2019-12-14 作者: Fan ZY(范志赟); Li J(李杰); Zhou SY(周世玉); Xu B(徐波) Adobe PDF(361Kb)  |  收藏  |  浏览/下载:146/48  |  提交时间:2022/09/17 Speech-Transformer, speaker adaptation, end-to-end speech recognition, speaker aware training, i-vector |
| NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition 会议论文 , Sydney, Australia, 2019-9-20 ~ 2019-9-25 作者: Sheng, Fenfen; Chen, Zhineng; Xu, Bo 浏览  |  Adobe PDF(455Kb)  |  收藏  |  浏览/下载:209/61  |  提交时间:2020/06/12 |