基于结构信息增强的图神经网络研究 | |
呼奋宇![]() | |
2023-05-18 | |
页数 | 122 |
学位类型 | 博士 |
中文摘要 | 图数据在现实世界中广泛存在,它可以自然地表示不同的对象及其之间的复杂关系。例如,在社交网络中,用户可以表示为节点,用户之间的好友关系可以表示为边。随着神经网络和深度学习技术的不断发展,近年来图神经网络在图数据分析中逐渐成为主流方法。虽然相关研究已经取得一些进展,但现有方法在利用图结构信息方面仍然存在较大提升空间。首先,对于节点级别的研究,由于图数据结构的千变万化,节点之间通常存在错综复杂的关系。为了学到节点之间的相似性和差异性,图神经网络需要拟合一个非常复杂的非线性函数,这对于图神经网络的非线性表达能力提出了挑战。其次,在图级别任务中,图数据通常包含丰富的结构信息和语义信息,如何自适应地提取图数据中的关键信息,提升图神经网络的判别能力,也是一大挑战。另外,在特定的标签分布下,图神经网络的性能通常会受到限制。如何更好地挖掘图的拓扑结构和标签分布之间的关系,也具有重要意义。本文以充分利用图数据的结构信息为切入点,深入挖掘结构信息和特定问题之间的内在联系。具体地,在节点级别、图级别和标签级别三种不同层次的图任务中,研究如何增强图神经网络的非线性表达能力、关键信息的提取能力、和特定标签分布下的判别能力。本文取得的研究成果主要包含以下四项:
2. 提出一种基于二阶全局注意力的方法,用于图级别关键信息学习。图数据包含复杂多样的结构信息和语义信息,这些信息对图级别判别任务的影响可能不同,因此神经网络需要自适应地提取重要信息。针对此问题,本文从两方面设计全局注意力机制:首先,在通道方面,设计通道注意力机制对图表达的不同维度进行加权,从而对某些语义进行增强。特别地,考虑到不同通道语义之间存在关联关系,本文进一步提出一种二阶注意力机制。其次,由于同一个图中不同节点的重要性也不尽相同,本文还设计了节点注意力,对不同节点的权重进行自适应调节。实验结果表明,通道注意力和节点注意力可以有效学习图数据中的关键信息,最终提升图分类和图回归任务的性能。
|
英文摘要 | Graphs are ubiquitous in the real world, which can naturally represent different objects and their complex relationships. For example, in social networks, users can be represented as nodes and friendships between users can be represented as edges. With the continuous development of neural networks and deep learning, graph neural networks have gradually become the mainstream method for graph analytical tasks in recent years. Although some progress has been made in related research, there is still considerable room for improvement, particularly in making full use of graph structure information. Firstly, for node-level research, due to the diverse structure of the graph, there are often complex relationships between nodes. In order to judge the similarities and differences between nodes, graph neural networks need to learn an extremely complex non-linear mapping function, which poses a challenge to the non-linear expressive ability of graph neural networks. Secondly, in graph-level tasks, graphs typically contain rich structural and semantic information. How to adaptively extract key information and improve the discriminative ability of graph neural networks is also a major challenge. In addition, under a specific label distribution, the performance of graph neural networks is usually limited. How to better mine the relationships between the graph structure and the label distribution is also of great significance. Starting from fully utilizing the structure information of graphs, this thesis aims to explore the intrinsic relationships between structure information and specific problems. Specifically, from three different levels of graph analytical tasks, including node-level, graph-level, and label-level, investigation is conducted on how to enhance the non-linear expressive ability, the key information extraction ability, and the discriminative ability under specific label distribution. The main contributions of this thesis are summarized as follows: 1.This thesis proposes a method based on neighborhood interaction to tackle the problem of insufficient non-linear expressive ability of existing graph neural networks. We first summarize existing graph neural networks and point out that the only way to model non-linear relationships between nodes is the non-linear activation function. Then from the theoretical perspective, we prove that the non-linear activation functions brings in the interaction between neighbors, albeit in a subtle way. In order to enhance the non-linear expressive ability of graph neural networks, we propose to explicitly model the interaction information between neighbors, and then the neighborhood aggregation and neighborhood interaction are integrated. Due to the fact that 2. This thesis proposes a method based on a second-order global attention mechanism for the learning of graph-level key information. Graphs contain complex and rich structural and semantic information, which may have different impacts on the discrimination task. Therefore, graph neural networks need to adaptively extract important information. To this end, this thesis designs the global attention mechanism from two perspectives: first, in the channel aspect, a channel attention mechanism is designed to weight different dimensions of the graph representation, thereby enhancing some semantics. In particular, considering the correlation between the semantics of different channels, we further propose a second-order attention mechanism. Secondly, considering different nodes contribute to the graph representation differently, we also design the node attention mechanism to adjust node weights adaptively. Experimental results show that channel attention and node attention can effectively learn key information from graphs, leading to performance improvement in graph classification and graph regression tasks. 3. This thesis proposes a structure-encoding graph neural network based on mixture-of-experts. In scenarios where the distribution of labels is imbalanced, graph neural networks tend to predict towards the majority class, resulting in poor performance towards the minority class. Considering the relationship between the graph structure and the imbalanced distribution of labels, this thesis proposes a structure-encoding graph neural network based on mixture of multi-view experts. Firstly, in the process of neighborhood aggregation and graph pooling, a diversified message passing mechanism and a diversified message readout mechanism are designed, respectively. These two mechanisms can extract different structural information and obtain representations of different views. Among them, the representations under each view correspond to a set of multi expert classifiers. 4. This thesis proposes a structure-guided multi-task learning graph neural network. Taking the classic scenario of multi-task molecular property prediction as an example, this thesis focuses on the common label missing phenomenon. |
关键词 | 图神经网络 结构信息 邻居交互 注意力机制 多专家融合 多任务学习 |
语种 | 中文 |
七大方向——子方向分类 | 机器学习 |
国重实验室规划方向分类 | 智能计算与学习 |
是否有论文关联数据集需要存交 | 否 |
文献类型 | 学位论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/52302 |
专题 | 毕业生_博士学位论文 |
推荐引用方式 GB/T 7714 | 呼奋宇. 基于结构信息增强的图神经网络研究[D],2023. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
毕业论文-基于结构信息增强的图神经网络研(4023KB) | 学位论文 | 限制开放 | CC BY-NC-SA |
个性服务 |
推荐该条目 |
保存到收藏夹 |
查看访问统计 |
导出为Endnote文件 |
谷歌学术 |
谷歌学术中相似的文章 |
[呼奋宇]的文章 |
百度学术 |
百度学术中相似的文章 |
[呼奋宇]的文章 |
必应学术 |
必应学术中相似的文章 |
[呼奋宇]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论