CASIA OpenIR  > 模式识别国家重点实验室  > 自然语言处理
Towards Machine Translation in Semantic Vector Space
Jiajun Zhang; Shujie Liu; Mu Li; Ming Zhou; Chengqing Zong
2015-03
发表期刊ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP)
卷号14期号:2页码:26
摘要
Measuring the quality of the translation rules and their composition is an essential issue in the conventional statistical machine translation (SMT) framework. To express the translation quality, the previous lexical and phrasal probabilities are calculated only according to the co-occurrence statistics in the bilingual corpus and may be not reliable due to the data sparseness problem. To address this issue, we propose measuring the quality of the translation rules and their composition in the semantic vector embedding space (VES). We present a recursive neural network (RNN)-based translation framework, which includes two
submodels. One is the bilingually-constrained recursive auto-encoder, which is proposed to convert the lexical translation rules into compact real-valued vectors in the semantic VES. The other is a type-dependent recursive neural network, which is proposed to perform the decoding process by minimizing the semantic gap (meaning distance) between the source language string and its translation candidates at each state in a bottom-up structure. The RNN-based translation model is trained using a max-margin objective function that maximizes the margin between the reference translation and the n-best translations in forced decoding. In the experiments, we first show that the proposed vector representations for the translation rules are very reliable for application in translation modeling. We further show that the proposed type-dependent, RNN-based model can significantly improve the translation quality in the large-scale, end-to-end Chineseto-English translation evaluation.
关键词Statistical Machine Translation Recursive Neural Network Semantic Meaning Distance Vector Embedding Space Max-margin Training
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/19930
专题模式识别国家重点实验室_自然语言处理
推荐引用方式
GB/T 7714
Jiajun Zhang,Shujie Liu,Mu Li,et al. Towards Machine Translation in Semantic Vector Space[J]. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP),2015,14(2):26.
APA Jiajun Zhang,Shujie Liu,Mu Li,Ming Zhou,&Chengqing Zong.(2015).Towards Machine Translation in Semantic Vector Space.ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP),14(2),26.
MLA Jiajun Zhang,et al."Towards Machine Translation in Semantic Vector Space".ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP) 14.2(2015):26.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
ACM TALLIP,No.2, Vol(1375KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Jiajun Zhang]的文章
[Shujie Liu]的文章
[Mu Li]的文章
百度学术
百度学术中相似的文章
[Jiajun Zhang]的文章
[Shujie Liu]的文章
[Mu Li]的文章
必应学术
必应学术中相似的文章
[Jiajun Zhang]的文章
[Shujie Liu]的文章
[Mu Li]的文章
相关权益政策
暂无数据
收藏/分享
文件名: ACM TALLIP,No.2, Vol.14(March 2015), 26 pages- J. Zhang.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。