Object Detection Of Remote Sensing Images Based On Transformer

Posted on:2024-01-27

Degree:Master

Type:Thesis

Country:China

Candidate:C P Zhang

Full Text:PDF

GTID:2542306941997449

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

The applications of object detection,such as face recognition on smartphones and automatic scanning of courier codes,have already become integrated into our daily lives.In contrast,object detection in remote sensing images often takes place in places we are not familiar with,such as wildlife conservation,disaster relief,and national security monitoring.These applications are closely related to our safety,and therefore require higher accuracy and efficiency.In this study,after researching mainstream detection models,we chose the Swin Transformer based on the Transformer as the main structure,combined with a feature fusion encoder and task alignment encoder to construct a detection network.We also made improvements to the backbone network,the representation of rotated boxes,and the loss function,achieving an improvement in detection effect and accuracy.The main contributions of this study are as follows:(1)Remote sensing images often contain numerous small objects,and detecting them requires stronger feature extraction capabilities.Swin Transformer uses window-based attention mechanism instead of global attention mechanism to reduce computational complexity,but it also loses contextual information of the data.Although a mobile window attention mechanism has been added,the improvement is limited.To solve this problem,this article designs a feature enhancement module to help select representative data between each window and calculate global and channel attention to increase the contextual information in the features.The feature enhancement module is added to Swin Transformer to improve the contextual information in the features.(2)In remote sensing image object detection,targets such as airplanes,ships,and cars often appear at various angles and in clusters.To reduce the risk of falsely deleting overlapping objects,rotated bounding boxes are used instead of horizontal bounding boxes.However,the rectangular coordinate system’s representation of rotated boxes is too complex and not conducive to model training.To simplify the representation of rotated boxes,polar coordinate representation is adopted,and combined with polar ring area loss function,the polar coordinate loss function is designed in this article to solve the problem of separate calculations between angle and polar radius in the loss function without linkage.(3)Experimental validation was conducted using the DOTA dataset,achieving an m AP of 74.21%.A comparative analysis was performed with state-of-the-art models,and ablation experiments were conducted.Our proposed method exhibits excellent detection performance on small targets in remote sensing images.It also demonstrates higher detection accuracy in complex scenarios with multiple clustered objects.These results provide evidence that our proposed method is more suitable for remote sensing image object detection tasks.

Keywords/Search Tags:

Object Detection, Swin Transformer, loss function, Global attention mechanism

PDF Full Text Request

Related items

1	Damage Detection Algorithm Based On Object Shape And Attention Mechanism Characteristics
2	Research And Application On Ship Detection Algorithm Based On YOLOv7 And Attention Mechanism
3	YOLOv3 Remote Sensing Image Object Detection With Auxiliary Networks
4	Research On Optimization Of Object Detection Algorithm In Highway Congestion Based On Deep Learning
5	Rotating Object Detection In High Resolution Remote Sensing Images Based On Deep Learning
6	Research On Remote Sensing And Aerial Image Object Detection Based On Attention And Transformer Network
7	Research On Small Target Detection Method Of Remote Sensing Image Based On Residual Convolutional Network And TRANSFORMER Fusion
8	Transmission Line Target Detection Algorithm Based On Deep Learning
9	Research On Single-stage Remote Sensing And Aerial Image Object Detection
10	Research On Object Detection Method Of Remote Sensing Image Based On Deep Feature Fusion And Attention Mechanism