Optimization Method And Application Of Combinatorial Multi-Armed Bandit With Fairness Constraint

Posted on:2024-08-24

Degree:Master

Type:Thesis

Country:China

Candidate:S S Wang

Full Text:PDF

GTID:2568307100988779

Subject:Electronic information

Abstract/Summary:

PDF Full Text Request

The rapid development of artificial intelligence and machine learning has caused social concerns about fairness.Most current machine learning algorithms focus on benefit maximization,which will lead to unfair for users.A lot of research works use multi-armed bandit model to achieve the multi-objective optimization of fairness and benefit.However,they mainly focuses on theoretical verification and has shortcomings in application.As for this,this work proposes a fair combinatorial multi-armed bandit method and applies it to distributed system,which realizes benefit maximization based on meeting fairness requirements.The main work is as follows:(1)Investigated the research background of fairness algorithms and multi-armed bandit algorithms,and sorted out related research work about fair multi-armed bandit algorithms.Most of the existing methods only consider the simulation model of small number of arms,which will cause model inaccuracy in large-scale distributed systems.For this problem,this paper designs a fair lottery algorithm F-Lottery,which makes full use of the probabilistic fairness and randomness of lottery scheduling.(2)In order to realize the multi-objective optimization of fairness and benefit,based on fair lottery algorithm and the upper confidence bound algorithm,this paper proposes a fairness combined multi-armed bandit optimization algorithm named FairLUCB.Fair lottery algorithm F-Lottery is used to quantify the fairness index,and UCB algorithm is used to achieve benefit maximization.The benefit value of fair lottery algorithm and UCB algorithm are weighted and combined to design a combined benefit value.Finally,an overall framework that takes into account both fairness and benefit has been formed.Through theoretical analysis,the logarithmic regret bound of FairLUCB algorithm is derived,and the theoretical feasibility of it is proved.(3)This paper not only uses simulation experiments to evaluate the effectiveness of the optimization method,but also applies the Fair-LUCB algorithm to client selection process in federated learning.This is a new solution framework for fairness research.In experiment,the minimum utilization rate of the client is used as the fairness requirement,and the average training accuracy maximization is used as the benefit goal.The experimental results show that the Fair-LUCB algorithm can achieve 27.4%average training accuracy higher than baseline algorithm,and achieve doubled time efficiency based on meeting fairness requirements.Experiments have proved that the proposed optimization method can achieve multi-objective optimization of fairness and training accuracy maximization in the distributed system.

Keywords/Search Tags:

fairness, multi-objective optimization, multi-armed bandit, lottery scheduling, federated learning

PDF Full Text Request

Related items

1	Research On Client Selection For Federated Learning Using Combinatorial Multi-Armed Bandit
2	User Scheduling For Federated Learning In Wireless Networks Based On Reinforcement Learning And Matching Theory
3	Research On The Algorithm Of Personalized Learning System Based On Multi-armed Bandit
4	A Buffered Semi-Asynchronous Mechanism With MAB For Efficient Federated Learning
5	Research On User Scheduling And Resource Allocation Algorithm For Federated Learning
6	Research On Channel Selection Mechanism Based On Multi-armed Bandit In Cognitive Network
7	Research And Application Of Large-scale Multi-armed Bandit Algorithm
8	Study On Relay Selection Algorithm Based On Multi-Armed Bandit In Underwater Acoustic Cooperative Communication Networks
9	Research On Resource Allocation And Optimization Strategy Based On Multi-Armed Bandit In Wireless Networks
10	Research On Multi-armed Bandit Aided Online Learning Approach For Wireless Caching Strategy