FedDCT: Federated Learning of Large  Convolutional Neural Networks on  Resource-Constrained Devices Using  Divide and Collaborative Training

Nguyen, Quan; Pham, Hieu H.; Nguyen, Phi Le; Wong, Kok-Seng; Nguyen, Truong Thao; Do, Minh N.

dc.contributor.author	Nguyen, Quan
dc.contributor.author	Pham, Hieu H.
dc.contributor.author	Nguyen, Phi Le
dc.contributor.author	Wong, Kok-Seng
dc.contributor.author	Nguyen, Truong Thao
dc.contributor.author	Do, Minh N.
dc.date.accessioned	2024-11-21T16:26:49Z
dc.date.available	2024-11-21T16:26:49Z
dc.date.issued	2024-01
dc.identifier.uri	https://vinspace.edu.vn/handle/VIN/433
dc.description.abstract	In Federated Learning (FL), the size of local models matters. On the one hand, it is logical to use large-capacity neural networks in pursuit of high performance. On the other hand, deep convolutional neural networks (CNNs) are exceedingly parameter-hungry, which makes memory a significant bottleneck when training large-scale CNNs on hardware-constrained devices such as smartphones or wearables sensors. Current state-of-the-art (SOTA) FL approaches either only test their convergence properties on tiny CNNs with inferior accuracy or assume clients have the adequate processing power to train large models, which remains a formidable obstacle in actual practice. To overcome these issues, we introduce FedDCT, a novel distributed learning paradigm that enables the usage of large, high-performance CNNs on resource-limited edge devices. As opposed to traditional FL approaches, which require each client to train the full-size neural network independently during each training round, the proposed FedDCT allows a cluster of several clients to collaboratively train a large deep learning model by dividing it into an ensemble of several small sub-models and train them on multiple devices in parallel while maintaining privacy. In this collaborative training process, clients from the same cluster can also learn from each other, further improving their ensemble performance. In the aggregation stage, the server takes a weighted average of all the ensemble models trained by all the clusters. FedDCT reduces the memory requirements and allows low-end devices to participate in FL. We empirically conduct extensive experiments on standardized datasets, including CIFAR-10, CIFAR-100, and two real-world medical datasets HAM10000 and VAIPE. Experimental results show that FedDCT outperforms a set of current SOTA FL methods with interesting convergence behaviors. Furthermore, compared to other existing approaches, FedDCT achieves higher accuracy and substantially reduces the number of communication rounds (with 4–8 times fewer memory requirements) to achieve the desired accuracy on the testing dataset without incurring any extra training cost on the server side	en_US
dc.language.iso	en	en_US
dc.subject	federated learning	en_US
dc.subject	deep convolutional neural networks	en_US
dc.subject	split learning	en_US
dc.subject	edge devices	en_US
dc.subject	edge learning	en_US
dc.subject	collaborative training	en_US
dc.title	FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource-Constrained Devices Using Divide and Collaborative Training	en_US
dc.type	Article	en_US

Các tập tin trong tài liệu này

Name:: FedDCT_Federated_Learning_of_L ...
Kích thước:: 7.051Mb
Định dạng:: PDF

Xem/Mở

Tài liệu này xuất hiện trong Bộ sưu tập

Kok-Seng Wong, PhD [19]
Associate Professor, Computer Science program, College of Engineering and Computer Science

Hiển thị đơn giản biểu ghi