面向非独立同分布场景的自适应与个性化联邦控制方法 | |
陈薏竹![]() | |
2022-05-17 | |
页数 | 94 |
学位类型 | 硕士 |
中文摘要 | 大数据时代广泛存在的数据孤岛问题对人工智能的应用与落地提出了新的挑战,为进一步发挥人工智能和大数据的潜力,联邦生态从系统化的角度定义了以联邦数据、联邦控制、联邦管理、联邦服务为核心的统一整体,有助于推动从数据、服务到智能的自动化转变,这一从上游数据到下游应用的标准化、体系化的解决方案也为不同领域的实际管理需求提供了理论层面的指导。其中,联邦学习将为其提供算法层面的技术支持。 |
英文摘要 | The widespread problem of data silos in the era of big data poses new challenges for applying and implementing artificial intelligence. Federated learning aims to establish a secure collaborative mode of large-scale distributed devices, providing a new way to break down data silos. In order to further develop the potential of artificial intelligence and big data, the federated ecology defines a unified whole centred on federated data, federated control, federated management, and federated service from a systematic perspective, which helps to promote automated transformation from data, service to intelligence. This standardized and systematic solution from upstream data to downstream applications also provides theoretical guidance for practical management needs in different fields. As the core of the federated ecology, federated control can transform the abstract tasks from federated management into specific execution rules and control the scheduling of federated nodes, including formulating node selection strategies and collaboration rules. However, due to the differences in data collection conditions and equipment conditions of different federated nodes in real scenarios, the local data is usually not independent and identically distributed (IID), which will harm the final effect of the federated model. Therefore, this thesis studies the federated control methods on non-independent and identically distributed (Non-IID) scenarios to solve the above problems. The main works are presented as follows: 1. Construction and analysis of Non-IID scenarios for federated control. Aiming at the widespread Non-IID situations such as label distribution skew and data volume skew in federated collaboration scenarios, this thesis firstly simulates real scenarios to divide two public image classification datasets MNIST and Fashion-MNIST, and further constructs datasets with the imbalanced number of classes and samples. Subsequently, this thesis further analyzes the impact of different distributions and different skew degree on the federated learning model, which provides a basis for designing a federated control method with higher training efficiency and greater adaptability. 2. An adaptive federated control method based on training trend evaluation. To reduce the high communication cost of the federated control when scheduling clients for cooperation, this thesis optimizes the node selection strategy and information integration rules of the server, which play the role of the federated control centre, starting from the two steps of participant selection and global model update. For the participant selection step, a training trend evaluation method based on the Mann-Kendall trend test is designed. By tracking the local training effect of each client, the most suitable clients will be selected to participate in the current round of training. For the global model update step, this method further adds the gradient similarity coefficient as the weight of each participant. Experiments show that this method can complete the federated training task efficiently and improve the indicators of the global model, such as accuracy. 3. A personalized federated control method based on similarity measures. To further meet the individual needs of different nodes, this thesis designs a clustered personalized federated control method with low communication and storage costs, which optimizes the collaborative rules of federated control. First, the server will maintain multiple global cluster models for all clients. When the client completes local training, the server will promptly schedule the similarity measurement method to update the cluster it belongs to. In this process, neither additional data transmission nor much intermediate information needs to be maintained. In addition, the regularization-based local training method also makes the local iteration more efficient, which further improves the personalization performance of the cluster models. Experimental results show that this method can achieve the effect of federated clustering training as expected and meet the individual needs of every client without generating additional communication costs and excessive computational overhead. |
关键词 | 自适应联邦控制 个性化联邦控制 联邦学习 非独立同分布 |
语种 | 中文 |
文献类型 | 学位论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/48884 |
专题 | 毕业生_硕士学位论文 |
推荐引用方式 GB/T 7714 | 陈薏竹. 面向非独立同分布场景的自适应与个性化联邦控制方法[D]. 中国科学院自动化研究所. 中国科学院自动化研究所,2022. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
陈薏竹-面向非独立同分布场景的自适应与个(10279KB) | 学位论文 | 限制开放 | CC BY-NC-SA |
个性服务 |
推荐该条目 |
保存到收藏夹 |
查看访问统计 |
导出为Endnote文件 |
谷歌学术 |
谷歌学术中相似的文章 |
[陈薏竹]的文章 |
百度学术 |
百度学术中相似的文章 |
[陈薏竹]的文章 |
必应学术 |
必应学术中相似的文章 |
[陈薏竹]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论