Word Semantic Similarity Based on CiLin and Word2vec | |
Yushang Mao1,2; Guixuan Zhang1; Shuwu Zhang1 | |
2020-10 | |
会议名称 | the 1st International Conference on Culture-oriented Science & Technology (ICCST) |
会议日期 | October 30-31, 2020 |
会议地点 | Beijing, China |
会议举办国 | 中国 |
会议录编者/会议主办者 | 中国传媒大学,中国科学院自动化研究所 |
产权排序 | 1 |
摘要 | This paper presents a method to calculate the semantic similarity with TongyiciCiLin and Word2vec. In the part of CiLin, the semantic similarity of words is calculated by using the distance of words as the main factor, the number of branches and the distance between branches as the fine-tuning parameters. In the part of Word2vec, this paper constructs a special Corpus based on movie review, and uses Word2vec model to calculate the semantic similarity of Chinese words. Then, the final semantic similarity is calculated by using the dynamic weighting strategy to fuse CiLin and Word2vec. The method makes full use of the semantic information of words in the knowledge base and Corpus. The experimental results show that the algorithm has better accuracy and more robust to domain sensitivity. |
七大方向——子方向分类 | 人工智能+文化 |
文献类型 | 会议论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/47524 |
专题 | 数字内容技术与服务研究中心_版权智能与文化计算 |
通讯作者 | Yushang Mao |
作者单位 | 1.Institute of Automation, Chinese Academy of Sciences 2.University of Chinese Academy of Sciences |
第一作者单位 | 中国科学院自动化研究所 |
通讯作者单位 | 中国科学院自动化研究所 |
推荐引用方式 GB/T 7714 | Yushang Mao,Guixuan Zhang,Shuwu Zhang. Word Semantic Similarity Based on CiLin and Word2vec[C]//中国传媒大学,中国科学院自动化研究所,2020. |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
MaoYushang_Word Sema(285KB) | 会议论文 | 开放获取 | CC BY-NC-SA | 浏览 下载 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论