Design And Implementation Of Elastic Deep Learning Model Deployment Platform Based On Kubernetes

Posted on:2023-06-08

Degree:Master

Type:Thesis

Country:China

Candidate:L X Shen

Full Text:PDF

GTID:2558306911984959

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

Artificial intelligence and cloud native are the current mainstream frontier technology fields driven by open source software.Therefore,AI platforms based on cloud native architecture are also constantly being explored and practiced.As one of the important branches in the field of artificial intelligence,deep learning has been deeply researched and achieved considerable results on how to train accurate network models.However,from data collection to the construction of network models,the ultimate goal of deep learning projects is to deploy models online to provide services.Therefore,the current hot technology of artificial intelligence extends from improving the accuracy of algorithm models to the engineering of models.There are the following problems in the process of model landing engineering:(1)Different models have complex requirements for environment and version dependencies when encapsulating,and conflicts are prone to occur between models;(2)Model services need to consume a lot of computing resources and require Fully consider the impact of elastic resource allocation on cluster stability;(3)As a delay-sensitive task,model service has higher requirements on request response time and success rate.Therefore,the main research goal of this paper is to explore and design a flexible model serving method and design and implement an efficient model deployment solution.Aiming at the problem of deep learning model landing engineering,this paper builds a deep learning model deployment platform based on Docker container and Kubernetes container management technology,which encapsulates and automates the deployment of the trained model,and realizes elastic scaling and real-time monitoring of resources.Function.The main work of the thesis includes:(1)Distributed deployment of model services based on Kubernetes technology,and optimized for the lag problem existing in the native responsive expansion and contraction mechanism,a load prediction model based on EMD-LSTM was proposed,which was predicted according to historical load conditions.The future trend is to perform expansion/reduction operations in advance before high/low loads come.In the study of the load forecasting model,through the experimental comparison with the traditional statistical method ARIMA and the neural network-based algorithm LSTM,two current mainstream time series forecasting algorithms,it is verified that the EMD-LSTM model proposed in this paper is suitable for the load in the cloud environment.Predictions have higher accuracy.(2)For the current mainstream deep learning frameworks Py Torch and Tensorflow,this paper studies how to encapsulate the model trained based on these two frameworks into a service,and how to evenly forward the traffic to the back-end nodes when the service is deployed on multiple nodes.The load balancing of requests,combined with the load prediction model proposed in this paper,improves the native elastic scaling mechanism of Kubernetes,and designs and implements an elastic deep learning model deployment platform.Finally,the platform is tested from the two aspects of function and performance.In the performance test,it is verified that the improved predictive elastic expansion and shrinkage mechanism in this paper can successfully predict the future load trend,and perform expansion/reduction operations in advance,thereby avoiding the need to achieve The sudden increase in response time and the rejection of a large number of requests caused by the expansion operation at the load threshold effectively reduce the average response time of requests,improve the success rate of requests,and enable the model to provide services more stably.(3)In order to be able to know the running status and resource usage of the cluster at all times and ensure that all nodes are in normal working state,the model deployment platform is designed to monitor the Kubernetes cluster.

Keywords/Search Tags:

Model Deployment, Kubernetes, Autoscaling, EMD, LSTM

PDF Full Text Request

Related items

1	Adaptive Autoscaling On Kubernetes:optimization And Implementation
2	Research On Kubernetes-oriented Automatic Container Elastic Scaling Technology
3	Research And Application Of LSTM Based E-Commerce Platform Kubernetes Cluster Elastic Scaling
4	Design And Implementation Of Spark Platform For Big Data Streaming Computing Based On Kubernetes
5	Research And Implementation Of Container Resource Management System Based On Kubernetes
6	Research On Flexible Scaling Method Of Kubernetes Container For Web Application
7	Research And Implementation Of Algorithm For Autoscaling Overall Cloud Services
8	Design And Implementation Of A Continuous Integration And Deployment System Based On Kubernetes
9	Design And Implementation Of Container Cluster Deployment Management Platform Based On Docker
10	Research On Autoscaling Algorithms For Microservices In Edge Clouds