Knowledge Commons of Institute of Automation,CAS
A Class-specific Copy Network for Handling the Rare Word Problem in Neural Machine Translation | |
Feng Wang![]() ![]() ![]() ![]() ![]() | |
2017 | |
会议名称 | International Joint Conference on Neural Networks |
会议录名称 | IJCNN |
页码 | 2658-2664 |
会议日期 | 14-19 May 2017 |
会议地点 | Anchorage, AK, USA |
摘要 | Neural machine translation (NMT) has shown promising results and rapidly gained adoption in many large-scale settings. With the NMT model being widely used in empirical productions, its long-standing weakness in handling the rare and out of vocabulary words has been amplified a lot. In order to release the model from the stress of “understanding” the rare words, copy mechanism has been proposed to deal with the rare and unseen words for the neural network models using attention. However the negative side of the copy mechanism is that the model is only able to decide whether to copy or not. It is unable to detect which class should the rare word be copied to, such as person, location, and organization. This paper deeply investigates this limitation of the NMT model. As a result, we propose a new NMT model by novelly incorporating a class-specific copy network. With the network, the proposed NMT model is able to decide which class the words in the target belong to and which class in the source should be copied to. Experimental results on Chinese-English translation tasks show that the proposed model outperforms the traditional NMT model with a large margin especially for sentences containing the rare words. |
关键词 | Neural Machine Translation (Nmt) Copy Network Rare Word |
语种 | 英语 |
文献类型 | 会议论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/41055 |
专题 | 复杂系统认知与决策实验室_听觉模型与认知计算 数字内容技术与服务研究中心 |
通讯作者 | Wei Chen |
推荐引用方式 GB/T 7714 | Feng Wang,Wei Chen,Zhen yang,et al. A Class-specific Copy Network for Handling the Rare Word Problem in Neural Machine Translation[C],2017:2658-2664. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论