CASIA OpenIR  > 模式识别国家重点实验室  > 自然语言处理
Learning to Represent Review with Tensor Decomposition for Spam Detection
Wang Xuepeng1,2; Liu Kang1; He Shizhu1; Zhao Jun1,2
2016-11
会议名称the 2016 Conference on Empirical Methods in Natural Language Processing
会议录名称Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
会议日期November 1-5, 2016
会议地点Austin, Texas, USA
摘要Review spam detection is a key task in opinion mining. To accomplish this type of detection, previous work has focused mainly on effectively representing fake and non-fake reviews with discriminative features, which are discovered or elaborately designed by experts or developers. This paper proposes a novel review spam detection method that learns the representation of reviews automatically instead of heavily relying on experts’ knowledge in a data-driven manner. More specifically, according to 11 relations (generated automatically from two basic patterns) between reviewers and products, we employ tensor decomposition to learn the embeddings of the reviewers and products in a vector space. We collect relations between any two entities (reviewers and products), which results in much useful and global information. We concatenate the review text, the embeddings of the reviewer and the reviewed product as the representation of a review. Based on such representations, the classifier could identify the opinion spam more precisely. Experimental results on an open Yelp dataset show that our method could effectively enhance the spam detection accuracy compared with the stateof- the-art methods. 
关键词Represent Learning Tensor Decomposition Spam Detection
文献类型会议论文
条目标识符http://ir.ia.ac.cn/handle/173211/14493
专题模式识别国家重点实验室_自然语言处理
通讯作者Liu Kang
作者单位1.National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
2.University of Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Wang Xuepeng,Liu Kang,He Shizhu,et al. Learning to Represent Review with Tensor Decomposition for Spam Detection[C],2016.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
D16-1083Learning to (399KB)会议论文 开放获取CC BY-NC-SA浏览 请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Wang Xuepeng]的文章
[Liu Kang]的文章
[He Shizhu]的文章
百度学术
百度学术中相似的文章
[Wang Xuepeng]的文章
[Liu Kang]的文章
[He Shizhu]的文章
必应学术
必应学术中相似的文章
[Wang Xuepeng]的文章
[Liu Kang]的文章
[He Shizhu]的文章
相关权益政策
暂无数据
收藏/分享
文件名: D16-1083Learning to Represent Review with Tensor Decomposition for Spam Detection.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。