基于Transformer与迁移学习的神经精神疾病分类研究

CASIA OpenIR > 毕业生 > 硕士学位论文

	基于Transformer与迁移学习的神经精神疾病分类研究
	李超
	2022-05-20
页数	82
学位类型	硕士
中文摘要	深度学习是目前最为热门的研究领域之一，随着算法的不断改进创新以及数据资源的不断累积，深度学习被应用到越来越多的领域中并在多种任务上取得了非常突出的表现。在医学领域，已有许多研究尝试使用深度学习技术对神经影像进行分析，辅助医生进行神经精神疾病的诊断，在减少医生工作负担的同时提升诊断效率及其准确率。对于深度学习来说，深度模型训练数据的规模大小很大程度上影响着模型最终的性能表现。而与常见的自然场景数据集相比，神经影像数据集往往规模较小，这容易导致深度模型出现过拟合问题，一定程度上限制了深度学习在神经精神疾病分类任务上的进一步提升与发展。基于上述背景，本文重点研究基于磁共振影像的神经精神疾病分类问题，通过网络结构优化与引入迁移学习方法缓解深度学习在神经影像数据集上所遇到的过拟合问题，提升深度模型在神经精神疾病分类上的性能表现，主要工作内容与创新点可归纳如下：（1）提出了融合CNN与Transformer的新型网络架构Trans-ResNet。所提出模型兼具了Transformer易于对长距信息进行建模与CNN易于对位置信息进行建模的优点。我们在多种神经精神疾病数据集上对所提出方法进行验证，结果表明Trans-ResNet能够取得明显优于 CNN、Transformer模型的性能表现，所提出的融合方式能够有效缓解Transformer在磁共振影像数据集上的过拟合问题。（2）基于所提出模型，将先进的masked autoencoder (MAE)预训练方法引入至基于神经影像的神经精神疾病分类任务中，创新性地将MAE原始方法中的重建输入图像更改为重建CNN输出的特征图，使其能够适用于所提出的Trans-ResNet模型，实验结果表明MAE预训练方法能够在不利用标注信息的前提下，有效地提升深度模型在多种下游神经精神疾病分类任务上的性能表现。（3）融合多种先进的无监督预训练及有监督预训练方法，基于大规模预训练数据集对深度模型进行多阶段预训练，过程中引入生成式无监督学习、判别式无监督学习及多任务有监督学习方法，建立了适用于不同神经精神疾病分类任务的预训练模型。本文在多个神经精神疾病分类数据集上对所提出方法进行了验证。结果表明所提出的预训练方法能够显著提升深度模型的分类表现且具有良好的泛化性，在未来可用于各种其他神经精神疾病的分类任务上。
英文摘要	Deep learning is one of the most popular research areas at present. With the continuous improvement and innovation of algorithms and the accumulation of data resources, deep learning has been applied to more and more fields and achieved outstanding performance in various tasks. In the medical field, deep learning has been widely applied in studies of neuroimaging analysis, which helps doctors make better decisions in the diagnosis of neuropsychiatric diseases, and improve the efficiency and accuracy of diagnosis while reducing the workload of doctors. For deep learning, the performance of the model is highly dependent on the size of training dataset. Compared with natural image datasets, most of neuroimaging datasets are relatively small. The limited data easily leads the deep model to overfitting, which limits the further improvement of deep learning in neuropsychiatric disease classification. Based on the aforementioned background, we focus on the classification of neuropsychiatric diseases based on magnetic resonance imaging. We reduce the overfitting of deep models when training on the neuroimaging dataset by optimization of network structure and the application of transfer learning We successfully improve the performance of deep model in neuropsychiatry disease classification. The main research content and innovations can be summarized as follows: (1) We propose a new network architecture named Trans-ResNet, which combines CNN and Transformer. The proposed architecture has both the superior ability of Transformer to model long-term dependencies and superior ability of CNN to capture spatial information. We validate the proposed method on a variety of neuropsychiatric disease datasets. The results show that Trans-ResNet can achieve significantly better performance than CNN and Transformer. The proposed fusion strategy can effectively reduce the overfitting of Transformer on magnetic resonance imaging datasets. (2) Based on the proposed network architecture, we introduce the advanced pre-training technique, masked autoencoder (MAE), into the neuroimaging-based the diagnosis of neuropsychiatric diseases. While in MAE, the model is trained to reconstruct the input image. By changing the reconstruction target from input image to the output feature map of CNN, we make MAE suitable for the proposed Trans-ResNet. The experimental results show that MAE can effectively improve the performance of the deep models on the diagnosis of neuropsychiatric disease without using the annotation information. (3) We integrate a variety of advanced unsupervised pre-training and supervised pre-training methods, use a multi-stage pre-training method includes generative unsupervised learning, discriminative unsupervised learning and multi-task supervised learning methods to pre-train the deep model in on a large-scale dataset. We finally establish a pre-training model suitable for different neuropsychiatric disease classification datasets. We validate the proposed method on multiple neuropsychiatric disease classification datasets. The results show that the proposed pre-training method can significantly improve the classification performance of the deep model and has good generalization, which can be used for classification of other neuropsychiatric diseases in the future.
关键词	深度学习迁移学习神经精神疾病神经影像分析大规模预训练
语种	中文
文献类型	学位论文
条目标识符	http://ir.ia.ac.cn/handle/173211/48883
专题	毕业生_硕士学位论文
推荐引用方式 GB/T 7714	李超. 基于Transformer与迁移学习的神经精神疾病分类研究[D]. 中国科学院自动化研究所. 中国科学院自动化研究所,2022.

条目包含的文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
学位论文_李超.pdf（3620KB）	学位论文		限制开放	CC BY-NC-SA