Knowledge Commons of Institute of Automation,CAS
NCLS: Neural Cross-Lingual Summarization | |
Zhu, Junnan1,2![]() ![]() ![]() ![]() ![]() ![]() ![]() | |
2019-11 | |
会议名称 | Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference onNatural Language Processing |
会议日期 | 2019.11.3-2019.11.7 |
会议地点 | Hong Kong, China |
会议录编者/会议主办者 | Association for Computational Linguistics |
出版者 | Association for Computational Linguistics |
摘要 | Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. Existing methods simply divide this task into two steps: summarization and translation, leading to the problem of error propagation. To handle that, we present an end-to-end CLS framework, which we refer to as Neural Cross-Lingual Summarization (NCLS), for the first time. Moreover, we propose to further improve NCLS by incorporating two related tasks, monolingual summarization and machine translation, into the training process of CLS under multi-task learning. Due to the lack of supervised CLS data, we propose a round-trip translation strategy to acquire two high-quality large-scale CLS datasets based on existing monolingual summarization datasets. Experimental results have shown that our NCLS achieves remarkable improvement over traditional pipeline methods on both English-to-Chinese and Chinese-to-English CLS human-corrected test sets. In addition, NCLS with multi-task learning can further significantly improve the quality of generated summaries. We make our dataset and code publicly available here: http://www.nlpr.ia.ac.cn/cip/dataset.htm. |
收录类别 | EI |
语种 | 英语 |
七大方向——子方向分类 | 自然语言处理 |
文献类型 | 会议论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/39083 |
专题 | 多模态人工智能系统全国重点实验室_自然语言处理 |
通讯作者 | Zhou, Yu |
作者单位 | 1.National Laboratory of Pattern Recognition, Institute of Automation, CAS 2.University of Chinese Academy of Sciences 3.CAS Center for Excellence in Brain Science and Intelligence Technology |
第一作者单位 | 模式识别国家重点实验室 |
通讯作者单位 | 模式识别国家重点实验室 |
推荐引用方式 GB/T 7714 | Zhu, Junnan,Wang, Qian,Wang, Yining,et al. NCLS: Neural Cross-Lingual Summarization[C]//Association for Computational Linguistics:Association for Computational Linguistics,2019. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
emnlp-ijcnlp-2019.pd(518KB) | 会议论文 | 开放获取 | CC BY-NC-SA | 浏览 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论