CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共7条,第1-7条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 1, 页码: 153-168
作者:  Zefa Hu;  Ziyi Ni;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1525Kb)  |  收藏  |  浏览/下载:18/5  |  提交时间:2024/04/23
Medical dialogue understanding, information extraction, text generation, knowledge-enhanced prompt, low-resource setting, data augmentation  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:15/4  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:145/29  |  提交时间:2023/06/21
IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文
, Singapore, 2022.5
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:212/91  |  提交时间:2023/06/07
Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文
, Lisboa, Portugal, October 10–14, 2022
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:243/147  |  提交时间:2023/06/05
鸡尾酒会问题与相关听觉模型的研究现状与展望 期刊论文
自动化学报, 2019, 卷号: 45, 期号: 2, 页码: 234-251
作者:  黄雅婷;  石晶;  许家铭;  徐波
Adobe PDF(3009Kb)  |  收藏  |  浏览/下载:188/59  |  提交时间:2022/09/17
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文
0, 线上会议, 2021-7-18
作者:  Zhang Peng;  Xu Jiaming;  Shi Jing;  Hao Yunzhe;  Qin Lei;  Xu Bo
Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:225/58  |  提交时间:2021/06/21
audio-visual speech separation  robust  adversarial training method  time-domain approach