Research On Federated Learning Methods With Noisy Labels And Imbalanced Data

Posted on:2024-08-11

Degree:Master

Type:Thesis

Country:China

Candidate:H Y Li

Full Text:PDF

GTID:2568307151960449

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Federated learning is an emerging learning paradigm aimed at jointly learning knowledge from large-scale user nodes in a way that does not expose user privacy.However,due to the significant manpower required to obtain high-quality labeled data,user nodes inevitably carry noisy labeled data.At the same time,federated learning faces long tailed data distribution in application scenarios,where the number of data of different classes is highly imbalanced between local and global data.Both of these issues can lead to a serious decrease in model training effectiveness in federated learning scenarios.This thesis considers solving two problems simultaneously,and its main research content can be summarized as follows:Firstly,a long tailed robust and noisy resistant federated learning framework called Double Net Double Teacher(DNDT)framework is proposed.This framework is divided into two stages to process noisy labels and long tailed distribution data.In stage one,double networks are used to filter small loss samples for denoising,and in stage two,double teacher distillation is used to process long tailed distribution data.Secondly,three federated learning algorithms based on the DNDT framework,Fed Match,Fed Jo Co R,and Fed Coach,are proposed.These three algorithms adopt three different double network structures in the first stage and are implemented in conjunction with the same second stage design.Then,to address the communication cost issue of double network structures in federated learning,a low communication cost algorithm Fed Single Aug based on the DNDT framework is proposed.This algorithm uses a collaborative network simulated by a single network to filter noisy label samples,combined with a self-balanced method,effectively solving the problem of model accuracy degradation in federated scenarios with long tailed and noisy labels.Finally,two datasets with long tailed distribution and noisy labels were constructed based on CIFAR10 and Fashion MNIST.Based on these two datasets that are more in line with real-world scenarios,the performance of the four algorithms proposed in the paper based on the DNDT framework will be compared with existing federated learning algorithms.The results indicate that the Fed Single Aug algorithm has the best comprehensive performance.Subsequently,ablation experiments were conducted on the best performing Fed Single Aug to validate the effectiveness of the DNDT framework.

Keywords/Search Tags:

Federated Learning, Noisy Labels, Long-tailed Learning

PDF Full Text Request

Related items

1	Research On Robust Learning Methods Against Noisy Labels In Federated Learning
2	Research On Deep Learning Methods When Learning With Noisy Labels
3	Research On Learning With Noisy Labels Based On Label Distribution
4	Research On Key Technologies Of Privacy Preserving-oriented Modeling And Debugging In Federated Learning
5	Privacy-Preserving Federated Aggregation Technology For Data Quality
6	The Research On Long-tailed Visual Recognition Algorithms
7	Learn From Incomplete Labels In Image Classification
8	Research And Applications Of Multi-label Learning In Noisy Environment
9	Long-tailed Data Classification Based On Multi-granularity Feature Optimization
10	Research On Secure Federated Learning Based On Noisy Communication Links