A unified framework for multi-modal federated learning

doi:10.1016/j.neucom.2022.01.063

	A unified framework for multi-modal federated learning
	Xiong, Baochen 1,5; Yang, Xiaoshan2,3,5 ; Qi, Fan 4,5; Xu, Changsheng2,3,5
发表期刊	NEUROCOMPUTING
ISSN	0925-2312
	2022-04-01
卷号	480 页码:110-118
通讯作者	Xiong, Baochen(bcxiong@yeah.net)
摘要	Federated Learning (FL) is a machine learning setting that separates data and protects user privacy. Clients learn global models together without data interaction. However, due to the lack of high-quality labeled data collected from the real world, most of the existing FL methods still rely on single-modal data. In this paper, we consider a new problem of multimodal federated learning. Although multimodal data always benefits from the complementarity of different modalities, it is difficult to solve the multimodal FL problem with traditional FL methods due to the modality discrepancy. Therefore, we propose a unified framework to solve it. In our framework, we use the co-attention mechanism to fuse the complementary information of different modalities. Our enhanced FL algorithm can learn useful global features of different modalities to jointly train common models for all clients. In addition, we use a personalization method based on Model-Agnostic Meta-Learning(MAML) to adapt the final model for each client. Extensive experimental results on multimodal activity recognition tasks demonstrate the effectiveness of the proposed method. (c) 2022 Elsevier B.V. All rights reserved.
关键词	Multi-modal Federated learning Co-attention
DOI	10.1016/j.neucom.2022.01.063
收录类别	SCI
语种	英语
资助项目	National Key Research and Development Program of China[2018AAA0100604] ; National Natural Science Foundation of China[61720106006] ; National Natural Science Foundation of China[62072455] ; National Natural Science Foundation of China[61721004] ; National Natural Science Foundation of China[U1836220] ; National Natural Science Foundation of China[U1705262] ; National Natural Science Foundation of China[61872424]
项目资助者	National Key Research and Development Program of China ; National Natural Science Foundation of China
WOS研究方向	Computer Science
WOS类目	Computer Science, Artificial Intelligence
WOS记录号	WOS:000761796800009
出版者	ELSEVIER
引用统计	被引频次：19[WOS] [WOS记录] [WOS相关记录]
文献类型	期刊论文
条目标识符	http://ir.ia.ac.cn/handle/173211/48085
专题	多模态人工智能系统全国重点实验室_多媒体计算
通讯作者	Xiong, Baochen
作者单位	1.Zhengzhou Univ, Henan Inst Adv Technol, Zhengzhou, Peoples R China 2.Chinese Acad Sci, Inst Automat, NLPR, Beijing, Peoples R China 3.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China 4.Hefei Univ Technol, Hefei, Peoples R China 5.Peng Cheng Lab, Shenzhen, Peoples R China
推荐引用方式 GB/T 7714	Xiong, Baochen,Yang, Xiaoshan,Qi, Fan,et al. A unified framework for multi-modal federated learning[J]. NEUROCOMPUTING,2022,480:110-118.
APA	Xiong, Baochen,Yang, Xiaoshan,Qi, Fan,&Xu, Changsheng.(2022).A unified framework for multi-modal federated learning.NEUROCOMPUTING,480,110-118.
MLA	Xiong, Baochen,et al."A unified framework for multi-modal federated learning".NEUROCOMPUTING 480(2022):110-118.