Research On Thangka Image Object Detection Algorithm Based On Deep Learning

Posted on:2022-11-01

Degree:Master

Type:Thesis

Country:China

Candidate:G Y He

Full Text:PDF

GTID:2505306746451924

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

As a cultural relic treasure of the Chinese nation,Thangka images are related to religion,history,folk customs and many other aspects.Object detection on the headgear and seating of Thangka images can not only effectively help people understand the rich connotations contained in the images,but also provide a foundation for promoting a cultural power.Because Thangka images have the characteristics of complex background information,complex and diverse appearance of detection targets,and low image quality,higher requirements are put forward for target detection in Thangka images.The target detection algorithm based on deep learning has achieved fruitful results.However,due to the particularity of Thangka images,there is still room for improvement in detection accuracy and speed in the detection of headgear and seating.In view of the above problems,this paper selects the headgear and the seat in the Thangka image for detection research.The main research contents and innovations are as follows:(1)Build a Thangka image object detection dataset.Collect 883 Thangka images from the school library and the Academy of Fine Arts,and process them into a unified format.Secondly,use the wizard labeling assistant to label the headgear and seating of the thangka images in a rectangular frame.Finally,analyze the labeling data set and find that each sample Extremely unbalanced between categories.The Thangka image is enhanced by flipping,Gaussian noise and random occlusion,so as to improve the generalization ability of the detection algorithm in the Thangka image target detection.(2)In view of the characteristics of Thangka images with complex background information,complex and diverse appearance of detection targets,and low image quality,it is difficult for conventional detectors to effectively detect targets and there is a problem of missed detection.In this paper,based on the YOLOv5 detection algorithm,a target detection algorithm with dual optimization of hierarchical visual Transformer and attention mechanism is proposed.Firstly,the input image is sliced by the Focus module,and the sliced image features are extracted by the Swin Transformer.Secondly,the feature information of different layers is fused with the convolution kernel to obtain the multi-scale feature map.Finally,the self-attention mechanism is used to fuse the Neck module features are processed.The experimental results show that the detection accuracy of the improved YOLOv5 algorithm is 5.12%higher than that of the original YOLOv5 algorithm,and its detection accuracy is also better than other mainstream target detection algorithms.(3)Considering that the fixed anchor in the YOLOv5 algorithm damages the universality of the algorithm,and a large number of candidate frame generation also affects the performance of the detection algorithm.This paper further combines the characteristics of Thangka images,and takes Anchor free YOLOX as the baseline,and proposes an improved YOLOX Thangka image object detection algorithm.Firstly,the feature map extracted by the Darknet53 feature extraction network is fused using the coordinate attention mechanism,which strengthens the long-range dependency information and precise location information.Secondly,a new feature fusion method,Trans FPN,is proposed,which can perform better in feature fusion.It enriches the semantic information of global features and context features,and improves the accuracy of target detection.The experimental results show that the improved YOLOX algorithm has better detection accuracy,and the detection accuracy on Thangka images is improved by 6.4%.

Keywords/Search Tags:

Thangka target detection, YOLOv5, YOLOX, Swin Transformer, TransFPN

PDF Full Text Request

Related items

1	Research And Application Of Automatic Detection Of Thangka Elements Based On Deep Learning
2	Research And Implementation Of Thangka Image Object Detection Algorithm
3	Research On Image Detection And Retrieval Model Of Silk Cultural Relics Under The Framework Of Deep Learning
4	Research On Ancient Text Detection Based On Transformer
5	Research On Thangka Cultural Element Detection Based On Improved FCOS Algorithm
6	Research On Multi Scale Tibetan Thangka Dominant Object Detection Algorithm
7	Research On Few-shot Object Detection Algorithm For Thangka Image
8	Research On Thangka Image Description Generation Method Based On Deep Learning
9	Research On The Classification And Few Shot Object Detection Algorithm Of Thangka Yidam Based On Deep Learning
10	Target Detection And Emotional Memory In Different Learning Styles