Research On Automatic Vocal Transcription Of Chinese Popular Music

Posted on:2021-03-20

Degree:Master

Type:Thesis

Country:China

Candidate:J Zheng

Full Text:PDF

GTID:2415330623969210

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Vocal transcription,as an important branch of transcription tasks,has gradually attracted the attention of the scholars in the field of music computing in recent years.However,due to the instability of vocal pronunciation and the lack of a large music dataset which has high-accuracy vocal onset annotation,the onset detection(one of the critical steps in transcription)of vocal in popular music is much more difficult than that of musical instruments.Vocal transcription is therefore limited and cannot be effectively applied in practice.In view of this,we study the automatic vocal transcription in Chinese popular music.The main research contents and results are as follows:1)We propose a sentence segmentation algorithm based on voice activity detection.The system uses this algorithm to cut music into sentences intelligently to meet the time length requirement of the onset detection model,and avoid words being cut.2)We train a high precision vocal onset detection model on a speech dataset to accurately recognize the time of the note onset required for transcription task.The UNet network is introduced into the vocal onset detection task for the first time.An input format optimization strategy is proposed to convert the input single-channel spectrogram into multi-channel spectrogram,and the extreme imbalance of sequence data is resolved through positive sequence radiation and Dice Loss.3)A small Chinese pop music test dataset with vocal onset annotation is constructed and open sourced,and the model trained on the speech dataset is transferred to a real music scene for testing.At the same time,we propose a reliability filtering layer and a breath sound filtering layer to optimize the recognition effect after transferred,so that the model can achieve a good onset recognition effect in real music scene.4)We propose a note block pitch selection algorithm based on nearest neighbor name matching.After optimizing the Harvest pitch recognition algorithm to increase its running speed to 10 times,we use this algorithm to calculate the representative pitch of each note.In summary,we propose a complete set of vocal transcription solutions that can intelligently identify onset time and pitch information,and finally generate a MIDI-file which contains the accuracy music sheet without the help of lyrics.

Keywords/Search Tags:

Automatic Music Transcription, Vocal Onset Detection, U-Net, Pitch Recognition, Chinese Pop Music

PDF Full Text Request

Related items

1	Research And Implementation Of A CNN-based Piano Music Transcription Algorithm
2	Research And Implementation Of A CNN-based Polyphonic Piano Transcription Algorithm
3	Research On Automatic Singing Transcription System:from Singing Signal To MIDI Files
4	Research On Automatic Transcription Algorithm Of Piano Music Based On CNN-HMM
5	Research And Implementation Of A Vision-based Piano Transcription System
6	Research On Music Denoising And Automatic Transcription Based On Deep Learning
7	Separation And Automatic Transcription Of Piano Music In Complex Environments
8	Concatenative Music Synthesis by Note Separation using Onset Velocity Detection and Statistical Models
9	Research On Detection Technology Of Piano Music Signals
10	Research On Recognition And Error Detection Technology For Piano Playing Music