Sharing Knowledge without Sharing Data: Stitches can improve ensembles of disjointly trained models
By: Arthur Guijt , Dirk Thierens , Ellen Kerkhof and more
Deep learning has been shown to be very capable at performing many real-world tasks. However, this performance is often dependent on the presence of large and varied datasets. In some settings, like in the medical domain, data is often fragmented across parties, and cannot be readily shared. While federated learning addresses this situation, it is a solution that requires synchronicity of parties training a single model together, exchanging information about model weights. We investigate how asynchronous collaboration, where only already trained models are shared (e.g. as part of a publication), affects performance, and propose to use stitching as a method for combining models. Through taking a multi-objective perspective, where performance on each parties' data is viewed independently, we find that training solely on a single parties' data results in similar performance when merging with another parties' data, when considering performance on that single parties' data, while performance on other parties' data is notably worse. Moreover, while an ensemble of such individually trained networks generalizes better, performance on each parties' own dataset suffers. We find that combining intermediate representations in individually trained models with a well placed pair of stitching layers allows this performance to recover to a competitive degree while maintaining improved generalization, showing that asynchronous collaboration can yield competitive results.
Similar Papers
Choice Outweighs Effort: Facilitating Complementary Knowledge Fusion in Federated Learning via Re-calibration and Merit-discrimination
Machine Learning (CS)
Helps AI learn better from different sources.
Distributed Deep Learning using Stochastic Gradient Staleness
Machine Learning (CS)
Trains computer brains much faster for learning.
Asynchronous Personalized Federated Learning through Global Memorization
Machine Learning (CS)
Helps AI learn from phones without seeing your data.