Research On Domain Generalization Based On Image Frequency Domain Features

Posted on:2024-06-25

Degree:Master

Type:Thesis

Country:China

Candidate:J Y Wang

Full Text:PDF

GTID:2568306914965709

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

Domain generalization(DG)is a very important fundamental problem in computer vision and pattern recognition.Adaptation to out-ofdistribution data is a tough challenge for all deep learning algorithms that strongly rely on the independent and identically distributed assumption.It leads to unavoidable labor costs and confidence crises in realistic applications.For that,domain generalization aims at mining domainirrelevant knowledge from the source domain that can generalize to unseen target domains,and the key is how to guide the model to learn domain invariant knowledge.In this thesis,we focus on domain generalization for image classification tasks.The previous arts mainly focus on how to learn with inter-domain interpolation or mine domain-irrelevant knowledge,but less effort is paid to studying the very question of which components of images carry the semantic information shared across domains.In this thesis,we tackle domain generalization problems with empirical prior knowledge.First,we propose a disentanglement and interaction method based on high-and low-frequency features of images(H&L)in this thesis.By visualizing and experimental analyzing the high-and low-frequency components of the image,we find that the high-frequency information of images depicts object edge structure and contains high-level semantic information,which is naturally consistent across different domains,and the low-frequency component is much more domain-specific but contains object global structure of images.Motivated by the above analysis,we design an encoder-decoder structure to disentangle high-and lowfrequency features of an image and propose a spatial-mask-based information interaction mechanism to ensure the helpful knowledge from both two parts can cooperate effectively,which can obtain more representational features.Then,we propose a frequency-domain-based data augmentation method(FDAG)in this thesis,which introduces additive noise and multiplicative noise in the image frequency domain.On the one hand,the method enriches the number of source domain data and the diversity of distribution,and reduces the risk of overfitting the source domain data;on the other hand,it can alleviate the sensitivity of the model to frequency fluctuations,thus improving the robustness of the model in extracting highand low-frequency features of the image.Therefore,FDAG can be effectively combined with H&L to further improve the generalization ability of the model.The proposed method obtains state-of-the-art performance on three widely used domain generalization benchmarks.Moreover,extensive ablation studies are implemented to conduct an in-depth study and analysis of the method.

Keywords/Search Tags:

image classification, domain generalization, data augmentation, representation disentanglement, information interaction

PDF Full Text Request

Related items

1	The Research On Multi-source Domain Generalization For Image Classification
2	Research And Implementation On Image Classification Algorithm Based On Domain Generalization
3	Research On Fine-grained Image Classification Algorithm Based On Feature Interaction
4	Methods For Improving Generalization Of Convolutional Neural Networks On Image Classification And Detection
5	Research On Domain Generalization By Feature Style Augmentation
6	Research On Unknown Target Position Detection Method Based On Domain Generalization And Information Enhancement
7	Research On Data Augmentation Method Based On Data Patterns Supplement
8	Research On Few-Shot Image Classification Oriented By Task-Generalization
9	Domain Generalization In Deep Learning-Based Image Classification
10	Research Of Attribute Generalization Towards Classification Based On Domain Knowledge