Driver Gaze Prediction Based On Feature Fusion

Posted on:2023-03-19

Degree:Master

Type:Thesis

Country:China

Candidate:Z H Yu

Full Text:PDF

GTID:2542306914477164

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

With the increasing popularity of unmanned driving,the assisted driving system,as the upstream of the automatic driving predictive control chain,has received extensive research and attention in recent years.The driver’s gaze prediction is the core technology in the assisted driving system.By detecting the driver’s gaze direction,the driving system can assist the driver to make corresponding actions,so that the driver’s driving process is more comfortable,safe and stable.Compared with the limitations of manual features in the past,with the rapid development of deep learning in recent years,the feature extraction method based on deep neural network has higher accuracy and stronger generalization ability,so it has begun to be widely used in driver gaze prediction task.The main work of this thesis is as follows:(1)An appearance-based feature extraction model is proposed,which is mainly divided into a global face model and a local binocular model.For the global face model,the mainstream CNN network is used for feature extraction,and the 3D supervision signal is added to strengthen the model’s feature learning of the head pose,thereby improving the gaze prediction accuracy;for the local binocular model,a branch network is designed to detect multiple face regions.The feature extraction and feature fusion can strengthen the connection between local areas.(2)A geometry-based feature extraction model is proposed,and the features are mainly face landmarks and Delaunay triangles.Two feature extractors,DNN and GCN,are used to extract the above two types of features,and an identity normalization preprocessing and pseudo-labeling method is proposed to reduce the negative impact of identity bias on geometric feature expression,which can greatly improve the accuracy of gaze prediction.(3)Two model fusion strategies are proposed,namely linear weighted fusion and Transformer-based fusion.According to the respective characteristics of the two fusion methods,an experiment is designed to analyze the contribution rate of each sub-model and feature,and try to find the feature with the strongest expressive ability in the task of gaze prediction.

Keywords/Search Tags:

Convolutional Neural Network, 3D, supervisory signal, Graph Convolutional Neural Networks, Identity Normalization, Model Fusion

PDF Full Text Request

Related items

1	Dynamic Spatio-Temporal Graph Convolutional Neural Networks For Traffic Flow Forecasting
2	Research On Traffic Data Prediction Based On Graph Convolutional Network
3	Research On Hyperspectral Image Classification Algorithm Based On Graph Convolutional Network
4	Research On Pedestrian Detection Algorithm In The Field Of Autonomous Driving Based On Convolutional Neural Network
5	Research On Vehicle Re-identification Algorithm Based On Graph Convolutional Neural Network
6	Research On Traffic Flow Forecasting Method Based On Spatio-temporal Graph Convolutional Neural Networks
7	Research On Traffic Flow Prediction Method Based On Parallel Neural Networks
8	Fault Diagnosis Method Of Rotating Machinery Based On Graph Convolutional Network
9	Research Of Parking Prediction Algorithm Based On Deep Graph Neural Network
10	Applied Research Of Spatial Temporal Graph Model In Traffic Flow Prediction