基于预训练语言模型的媒体传播事件分析方法研究

CASIA OpenIR > 毕业生 > 硕士学位论文

	基于预训练语言模型的媒体传播事件分析方法研究
	钱昊达
	2022-05
页数	100
学位类型	硕士
中文摘要	随着新媒体传播方式的变革创新，线上媒体平台成为事件信息传播的主要载体，提供蕴含事件关联的情绪、主题及机构等富有互动性且多样化的多维度传播内容^[1]，深入挖掘这类信息有助于管理部门及时感知事件传播过程中用户的情绪态势、了解事件话题分布、跟踪事件关联机构，为进一步评估事件传播影响力提供决策支持。本文旨在借鉴预训练模型、图神经网络、阅读理解等领域的研究成果，从情绪要素提取、话题分布挖掘、主体机构判别三个方面开展对事件要素进行深度分析的方法研究，主要工作内容总结如下： 1. 基于阅读理解框架的情绪要素提取方法。事件内容蕴含丰富的情绪要素，体现了事件背后的深层情感趋势。针对现有情绪要素提取方法建模深度不足，缺少目标要素信息融合的问题，本文提出一种基于阅读理解框架的情绪要素提取方法。该方法首先以情绪要素查询为先验知识，基于预训练语言模型融合目标信息和文本内容，生成目标要素导向的文本语义特征表示。然后，利用层级多任务学习框架优化答案选择结果并允许模型抽取多个情绪要素。两个公开数据集上的实验结果验证了引入查询与多任务学习机制的情绪要素提取模型的有效性。 2. 基于异质图网络的事件话题挖掘方法。话题是事件主旨内容的概括性表达，为解决话题分布挖掘过程中存在文档语义稀疏和主题语义重叠的关键性问题，本文提出基于异质图网络的事件话题挖掘方法。该方法基于文档和词语构建异质文本图网络，在表示学习过程中，采用双通道编码模块通过多层异质图卷积网络和自编码器分别学习文档的结构和语义信息。模型将自编码器和图卷积网络在每一层的输出的隐藏状态结合以获得更全面的文档表示。此外，模型通过双重监督机制统一指导两个通道的学习过程。在真实事件话题数据集上的实验结果表明深度融合预训练模型和异质图的方法提高了话题分布挖掘的性能。 3. 基于多轮问答框架的主题-主体机构判别方法。事件传播过程涉及若干机构单位，而机构与事件所属话题通常存在潜在关联关系。为了充分利用事件话题信息从而准确判别事件相关的主要机构，本文提出基于多轮问答框架的主题-主体机构识别方法。该方法利用查询与预训练模型编码得到任务导向的上下文表示。模型在第一轮分别以文本段提取和选择题形式获取机构实体与主题，然后基于这些答案在第二轮构造查询完成主体机构判别。实验表明，提出的主题-主体机构识别方法能够有效挖掘机构、主题和主体机构之间的深层关联，从而提升主体机构识别的性能。
英文摘要	With the revolutionary changes of new media communication, online media platforms have become the main carrier of event diffusion, providing interactive and diverse multi-dimensional communication content with event-related emotions, topics and institutions. Mining these information facilitates the management department to perceive emotional situation, understand the distribution of topics, and track event-related institutions, which can provide decision support by assessing the impact of the event diffusion. This thesis aims to take advantage of the research progress in pre-trained language models, graph neural networks and reading comprehension based framework to analyze events from three aspects including emotion element extraction, topic mining and key institution recognition. The major works of this thesis are summarized as follows. 1. Firstly, a machine reading comprehension based method is proposed for emotion element extraction. The event contains rich emotion elements which reveal the deep emotional trends behind the event. To alleviate the problem of insufficient modeling granularity and lack of target information fusion in existing emotion element extraction models, this thesis proposes a machine reading comprehension based extraction method. The method first takes emotion element-related query as prior knowledge, and fuses target information with text content based on a pre-trained language model to obtain element-oriented text representation. Then, a hierarchical multi-task learning mechanism is designed to enhance the answer selection process and enables the model to extract multiple emotion elements at the same time. The experimental results on two public datasets demonstrate the efficacy of our proposed model with task-oriented query and multi-task learning structure. 2. Secondly, a heterogeneous graph based method is proposed for event topic mining. Topic serves as a generalized expression of the main content of an event. To tackle the document semantic sparsity and topic overlapping issue, we propose a heterogeneous text graph based event topic mining method, which constructs a heterogeneous text graph based on documents and words. During representation learning, a two-channel encoding module learns both structural and semantic information via multi-layer heterogeneous graph convolution network and auto-encodrer, seperately. We combine the hidden states learned by the auto-encoder and graph convolution network in each layer to obtain a more comprehensive representation of the document. Furthermore, a dual-supervised mechanism is used to uniformly guide the learning process of these two channels. Experiments on a real-world event topic dataset show that the proposed method that deeply combines pre-trained model and heterogeneous convolution neural graph significantly improves the topic mining performance. 3. Thirdly, a multi-turn question answering based method is proposed for topic and key institution recognition. Several institutions are involved in the course of event diffusion, and there is usually an implicit relationship between instutisions and event topics. To fully exploit event topic information to boost the performance of event-related key institution recognition, this thesis proposes a multi-turn question answering based method that jointly performs topic mining and key institution recognition. The method utilizes queries and pre-trained language model to obtain task-oriented contextualized semantic representation. The model peforms span selection to extract all the entities and answer multi-choice selection to mine topics in the first turn, then, it constructs queries based on previous answers and performs judgment to discriminate key institutions in the second turn. Experiments show that the proposed topic and key institution recognition method can effectively mine the deep relation among institutions, topics and key institusions to improve the performance of key institution recognition.
关键词	请输入关键词
语种	中文
文献类型	学位论文
条目标识符	http://ir.ia.ac.cn/handle/173211/48858
专题	毕业生_硕士学位论文
推荐引用方式 GB/T 7714	钱昊达. 基于预训练语言模型的媒体传播事件分析方法研究[D]. 中国科学院自动化研究所. 中国科学院自动化研究所,2022.

条目包含的文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
钱昊达毕业论文0522.pdf（1536KB）	学位论文		限制开放	CC BY-NC-SA