Research On Printed Score Recognition Method Based On Deep Learning

Posted on:2021-09-30

Degree:Master

Type:Thesis

Country:China

Candidate:X Jia

Full Text:PDF

GTID:2505306470470034

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Optical Music Recognition(OMR)is an important way to realize the digitization of musical score images.It has wide application prospects in computer music,digital music library,computer-assisted music teaching and other fields.Recognition of musical symbols is an important research content of optical score recognition.At present,there are some difficulties in music symbol recognition: the structure of the music score is very complicated,the size of the music symbol is extremely small,and some symbols are too dense(chords),etc.,which leads to low recognition accuracy of the music symbol.With the rapid development of artificial intelligence technology,deep learning algorithms have been tried and achieved preliminary results in the field of OMR,which has opened up a new path for OMR research.This article takes printed musical scores as the research object and researches the recognition of musical symbols based on deep learning algorithms.The specific research content is as follows:Firstly,for the problem that the current OMR data set has incomplete labeling information and can not recognize the duration and pitch of the note,study and generate a dataset containing pitch and duration tags.First of all,by analyzing the MXL(Music XML)file,the score image and corresponding label data required for model training are obtained;then,the duration label of the note is obtained by connecting the information such as the beam,the dot,and the flag,and the pitch is parsed from the position information of the note Labels;Finally,data enhancement methods such as noise and random cropping are added to increase data diversity.Secondly,to solve the problems of OMR recognition process is complicated and recognition accuracy is not high,the end-to-end printed music score recognition model based on YOLOv3 is studied.The model takes the entire score image as input,and directly outputs the symbol duration,pitch and classes.The model adopts multi-task learning,learning the classification tasks of pitch and duration and the regression tasks of note coordinates,which improves the recognition speed of the model.The model of this paper is verified by the score image testset.The experimental results show that the accuracy of note recognition is high,which can reach the accuracy of duration value of 0.96 and the accuracy of pitch of 0.98,which is higher than theaccuracy of note recognition of other score recognition models.Thirdly,in view of the difficulty of detecting polyphony(chords)in the score image,polyphonic score image recognition based on Fine-Tuning technology is studied.First,manually label an appropriate amount of score images as a polyphonic recognition data set;then add additional regression branches for specific symbols(tuplets)to improve the recognition accuracy of tuplets.Finally,combined with the neural network fine-tuning technology,the end-to-end printed music score recognition model based on YOLOv3 is fine-tuned to improve the accuracy of polyphonic music score recognition.The experimental results show that the recognition accuracy of polyphony and tuplet is greatly improved while keeping the symbol recognition accuracy unchanged.

Keywords/Search Tags:

Optical Music Recognition, Deep Learning, End-To-End Model, Note Recognition

PDF Full Text Request

Related items

1	Research On Printed Score Recognition Technology Based On Deep Learning
2	Deep Learning Algorithm Composition System Based On Music Score Recognition
3	Research Of Key Technologies Of Music Intelligent Sparring Partner
4	Humming Audio Spectrum Recognition Based On Deep Learning
5	Real-Time Music Beat Recognition Based On Deep Learning
6	Research Of ChuNom Image Recognition Algorithm Based On Deep Learning
7	Research And Application Of Water Book Image Recognition Algorithm Based On Deep Learning
8	Research On Recognition Method Of Calligraphy Stylein Deep Belief Network Driven By Multi-Volume Layer
9	Research Of Stave-Aware Optical Music Recognition Technology For Camera-Based Scenarios
10	Design Of Speech Recognition System For Northern Shaanxi Dialect Based On Deep Learning