Compression Of Convolutional Neural Networks Based On Groups Of Feature Templates

Posted on:2022-01-16

Degree:Master

Type:Thesis

Country:China

Candidate:G Li

Full Text:PDF

GTID:2518306338461304

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

Convolutional neural networks have received increasing attention in academia and industry,achieving performance far beyond traditional methods on many machine learning tasks,such as image classification,object detection,and generative models.However,as the model performance improves,the increasing computing demand limits the deployment of the model to a large number of edge scenarios.Meanwhile,applying complex-dataset-oriented models to smaller datasets would increase network model redundancy and enhance the risk of overfitting,which will degenerate model generalization performance.As a conventional method of model compression,network pruning can remove the redundant structures in the model.However,there are still problems such as complicated iterative tuning of parameters and insignificant saliency gap between different substructures.To solve these problems,the pruning algorithm in network compression is improved and studied in this paper,and the main work and contributions conducted for this purpose are as follows:(1)A convolutional neural network compression method based on feature template sets is proposed,which performs L0 regularization to solve the problem that conventional L1 and L2 regularization cannot provide a clear pruning saliency gap.Where,the Half-Quadratic Splitting(HQS)method is applied to get the approximate solution of the L0 regularization,which enables the use of many gradient-based approaches to solving this NP-hard problem approximately.Experiments show that the proposed method can significantly expand the saliency gap between network structures,improve the confidence of pruning threshold selection,and help reduce the workload of threshold selection at each layer,compared with the conventional L1 and L2 regularization methods.(2)A parameter selection algorithm is proposed to redefine the hyperparameters in the original HQS based method.Compared with the original hyperparameter set,the redefined parameter set has obvious physical significance,and also reduces the difficulty of tuning the algorithm on different models and datasets.Experiments show that most of the parameters can be used without major changes to achieve good results on different data and models.(3)The effectiveness of proposed method in specific engineering applications is verified by applying it to the aerial powerline semantic segmentation model,and the common workflow of model compression is also be shown,which including baseline model selection,data enhancement,loss function optimization and model compression.In practical applications,the proposed method shows a better compression effect.

Keywords/Search Tags:

convolutional neural network, model compression, network pruning, regularization

PDF Full Text Request

Related items

1	Pruning-Based Compression Method For Convolutional Neural Network
2	Research On Convolutional Neural Network Compression Method Based On Dynamic Pruning And Weight Resetting
3	Research On The Simplification Method Of Neural Network Mode
4	Research And Application Of Structured Model Pruning Algorithm In Deep Convolutional Neural Networks
5	Research And Application Of Structured Pruning Of Convolutional Neural Networks
6	Research On Compression Method For Convolutional Neural Network Based On Pruning
7	The Research On Algorithm Optimization Of Convolutional Neural Network Model Compression
8	Research And Optimization Of Convolutional Neural Network Model Compression Method
9	Research On Convolutional Neural Network Model Compression Based On Pruning
10	Research On Deep Network Compression Method Based On Model Gradient Information