A Time Series Classification Algorithm Based On Multi-scale Separable Convolution And Self-attention

Posted on:2024-07-04

Degree:Master

Type:Thesis

Country:China

Candidate:J Y He

Full Text:PDF

GTID:2530307079491344

Subject:Applied statistics

Abstract/Summary:

PDF Full Text Request

Time series classification is widely used in various fields such as behavior recognition,disease diagnosis,speech recognition,environmental monitoring,and industrial production.The difference between time series classification and general classification tasks lies in the fact that its features are ordered.However,current time series classification algorithms have certain limitations: On the one hand,traditional algorithms have high computational complexity,require laborious feature engineering,and demand researchers to possess domain-specific background knowledge.On the other hand,convolution-based deep learning algorithms are limited by their receptive field,which can only extract local features of the sequence and fail to capture the interdependent relationships between all time positions globally.The accuracy and generalization of these models still remain some space for improvement.To address these issues,this thesis proposes an end-to-end neural network structure that simultaneously extracts local information of the sequence at multiple scales and global temporal correlations,achieved mainly through a plug-in module called MSCMA.The module consists of two parallel branches: one branch uses depthwise separable convolution to improve the Inception Time module for extracting sub-sequence information at multiple scales;the other branch uses multihead self-attention with added positional encoding to extract global temporal correlations of the sequence.This paper conducts experiments on six single/multivariate time series datasets from UCR/UEA with seven comparative algorithms,and uses Friedman rank test and Hollander-Wolfe post hoc test to compare the model accuracy,verifying the significant improvement of the MSCMA model.This paper also provides a comparison of algorithm complexity,with the time complexity of the MSCMA module being (9)8)(9(8),which is competitive among the compared algorithms.Finally,this thesis addresses the weak interpretability of deep learning algorithms by conducting multidimensional interpretability research on Lightning2 data set.First,it uses class activation mapping to identify the subsequence that contributes most to the classification decision of the model;second,it uses Gramian angular field to encode one-dimensional convolutional kernel as an image and visualize the multi-scale convolutional kernel weights of the network to study their capture of different patterns of sequence information;finally,it calculates and visualizes the attention weight matrix to analyze the temporal correlations of different patterns of sequence information extracted by the multi-head self-attention.The interpretability analysis from these three perspectives all verify the effectiveness of the proposed structure in this thesis.

Keywords/Search Tags:

Time series classification, Multi-scale convolution, Depthwise separable convolution, Self-attention, Explainability

PDF Full Text Request

Related items

1	Motor Imagery EEG Signal Recognition Method Based On Depthwise Separable Convolution
2	Research On Classification Method Of Motor Imagery EEG Signals Based On Feature Expression And Separable Convolution
3	Coral Reef Benthic Substances Extraction Based On Convolutional Neural Networks
4	Research On EEG Emotion Recognition Based On Space-time Neural Networ
5	Research On Medical Image Segmentation Algorithm Based On Deep Learning
6	Application Of Semantic Segmentation Of Remote Sensing Image Based On Improved Unet In Surface Water Change
7	Time Series Prediction And Classification Method Based On Deep Learning
8	Research On Power Load Prediction Based On Time Series Convolution Lstm
9	Research And Application Of Microalgae Lightweight Detection Algorithm Based On Improved YOLO V7
10	X-ray Prohibited Items Detection Based On Multi-level Feature Pyramid