CASIA OpenIR  > 学术期刊  > Machine Intelligence Research
EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training
Yuxian Gu1,2; Jiaxin Wen1,2; Hao Sun1,2; Yi Song1,2; Pei Ke1,2; Chujie Zheng1,2; Zheng Zhang1,2; Jianzhu Yao2; Lei Liu3; Xiaoyan Zhu1,2; Minlie Huang1,2
发表期刊Machine Intelligence Research
ISSN2731-538X
2023
卷号20期号:2页码:207-219
摘要Large-scale pre-training has shown remarkable performance in building open-domain dialogue systems. However, previous works mainly focus on showing and evaluating the conversational performance of the released dialogue model, ignoring the discussion of some key factors towards a powerful human-like chatbot, especially in Chinese scenarios. In this paper, we conduct extensive experiments to investigate these under-explored factors, including data quality control, model architecture designs, training approaches, and decoding strategies. We propose EVA2.0, a large-scale pre-trained open-domain Chinese dialogue model with 2.8 billion parameters, and will make our models and codes publicly available. Automatic and human evaluations show that EVA2.0 significantly outperforms other open-source counterparts. We also discuss the limitations of this work by presenting some failure cases and pose some future research directions on large-scale Chinese open-domain dialogue systems.
关键词Natural language processing deep learning (DL) large-scale pre-training dialogue systems Chinese open-domain conversational model
DOI10.1007/s11633-022-1387-3
引用统计
被引频次:3[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/55975
专题学术期刊_Machine Intelligence Research
作者单位1.The Conversational AI Group, Tsinghua University, Beijing 100084, China
2.Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
3.Department of Electrical Engineering and Computer Science, York University, Toronto M3J1P3, Canada
推荐引用方式
GB/T 7714
Yuxian Gu,Jiaxin Wen,Hao Sun,et al. EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training[J]. Machine Intelligence Research,2023,20(2):207-219.
APA Yuxian Gu.,Jiaxin Wen.,Hao Sun.,Yi Song.,Pei Ke.,...&Minlie Huang.(2023).EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training.Machine Intelligence Research,20(2),207-219.
MLA Yuxian Gu,et al."EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training".Machine Intelligence Research 20.2(2023):207-219.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
MIR-2022-09-289.pdf(1846KB)期刊论文出版稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Yuxian Gu]的文章
[Jiaxin Wen]的文章
[Hao Sun]的文章
百度学术
百度学术中相似的文章
[Yuxian Gu]的文章
[Jiaxin Wen]的文章
[Hao Sun]的文章
必应学术
必应学术中相似的文章
[Yuxian Gu]的文章
[Jiaxin Wen]的文章
[Hao Sun]的文章
相关权益政策
暂无数据
收藏/分享
文件名: MIR-2022-09-289.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。