The Research On Probabilistic Prediction Based On Natural Gradient Boosting(NGBoost)

Posted on:2022-02-22

Degree:Master

Type:Thesis

Country:China

Candidate:S Liang

Full Text:PDF

GTID:2480306509489154

Subject:Applied Statistics

Abstract/Summary:

PDF Full Text Request

In the real world,many of the supervised machine learning problems we have encountered have tabular features and real-valued targets.But on some of these issues,the uncertainty estimation of forecasts is often crucial,such as when dealing with tasks such as health care and weather forecasting.Probabilistic prediction,which is the approach where the model outputs a full probability distribution over the entire outcome space,is a natural way to quantify those uncertainties.Gradient Boosting Machines(GBM)have been widely successful in prediction tasks on structured input data,but a simple boosting solution for probabilistic prediction of real valued outputs is yet to be made.In this article,we try to use the natural gradient boost(NGBoost)method which uses the Natural Gradient to address technical challenges.In the theoretical knowledge part of this article,we introduced that the natural gradient boosting method is modular,and the base learner,probability distribution and scoring rules in the structure can be freely selected.Based on the logarithmic scoring rule(MLE),we use the KL divergence in information theory to measure the difference between the distributions,and lead to the concept of natural gradient,which defines the search direction in the iterative algorithm.In the simulation experiment part of this article,we use the content data of SO₂in the air observed at the Beijing Olympic Sports Center site to predict the probability of air quality.We improved the choice of the base learner in the algorithm,and solved the probability prediction problem in regression tasks by using ridge regression,support vector regression,and echo state network models.At the same time,we use the original gradient booster to perform regression prediction.By comparing the effects of the simulation experiments under the two gradient algorithms,we can find that for the probability prediction problem,using natural gradients for model enhancement will have good performance in practical applications.

Keywords/Search Tags:

Probability Prediction, KL Divergence, Natural Gradient Boosting, Air Quality Prediction, Model Comparison

PDF Full Text Request

Related items

1	Research On AQI Prediction Model Of Hefei City Based On Boosting Algorithm
2	Method Research For The Prediction Of MicroRNA-disease Associations Based On Anti-noise Gradient Boosting Tree And Learning Interactions Of Featuresbased Algorithm
3	Research And Application Of Optimizing Survival Analysis Method By Gradient Boosting Tree
4	Research On Medium And Long Term Runoff Probability Prediction Method Coupled With CFS Ensemble Forecast And Deep Learning
5	Study On Metallogenic Prediction Based On The Fusion Model Of GBDT And SVM
6	Research On Reservoir Prediction Method Based On Probability Graph Model
7	Stock Price Prediction And Analysis Based On EEMD And XGBoost Algorithm
8	Research On Singularities In The Parameter Identification Of Dynamic Systems
9	Study On Groundwater Circulation Mechanism And Dynamic Prediction Of Lake Level In Hongjiannao Basin
10	Research On Continuous Prediction Method Of Respiratory Signal Based On Human Physiological Data