Research On Named Entity Recognition For Chinese Electronic Medical Records

Posted on:2023-08-09

Degree:Master

Type:Thesis

Country:China

Candidate:J D Lu

Full Text:PDF

GTID:2544306914973539

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

With the rapid development of computers and artificial intelligence,natural language processing technology has taken on a new life and is gradually penetrating into various fields.Medical care is an important field that is closely related to people’s lives,and the recognition of named entities in Chinese electronic medical records is a hot topic and a difficult area of research.At present,there are still many problems in the research on the recognition of named entities in Chinese electronic medical records.Firstly,due to the cost and patient privacy issues,the size of the dataset available for training is insufficient to obtain a model with high accuracy and robustness.Secondly,Chinese electronic medical records(EMR)are much complex while the current single-task model is not universal and compatible with different types of electronic medical record datasets.To address the above problems,this paper first introduces the major domestic and international named entity recognition techniques in the medical domain and points out their problems.On this basis,this paper constructs a new batch of medical named entity recognition datasets using the electronic medical record resources of tertiary hospitals.Then,this paper analyses the performance of three different structured baseline models,BiLSTM-CRF,Tranformer-CRF and BERT-CRF for the existing problems,proposes a Chinese EMR named entity recognition model based on multi-tasking and transfer learning.The innovations of this paper compared to the traditional single-task classification model is as follows:(1)Introducing and improving multi-task learning methods.A unique multi-task structure is used to set up unique decoders for different categories of Chinese electronic medical record datasets,so that the entire dataset can be fed into the model through collective training,which can effectively save time cost and computer hardware resources.(2)Introducing and improving transfer learning methods.Designing a shared encoder based on the BERT model enables potential common knowledge between different electronic medical records to be migrated and learned across datasets,and effectively prevents the model from over fitting and catastrophic forgetting.Finally,the experimental results demonstrate that the algorithm proposed in this paper has good compatibility and robustness,with better performance in terms of accuracy,recall and F1 score on all four datasets,especially for long-tailed data and small-scale datasets.

Keywords/Search Tags:

named entity recognition, medical text processing, multi-task learning, transfer learning

PDF Full Text Request

Related items

1	Transfer Learning Based Named Entity Recognition On Electronic Medical Records
2	Study On Named Entity Recognition Model Of Cancer Patient Online Questioning Text Based On Transfer Learning
3	Study On The Phenotypic Extraction Method Of Clinical Records Based On Multi-task Learning
4	Research On Method Of Medical Named Entity Recognition Based On Pre-trained Model
5	Research Of Medical Named Entity Recognition And Development Of Electronic Medical Record Marking System
6	Research And Application Of Medical Entity Extraction Based On Multi-task Learning And Transfer Learning
7	Named Entity Recognition In Medical Field Based On Deep Learning Of Chinese
8	Medical Text Named Entity Recognition Based On Improved Sequence Labeling Model
9	Research On Chinese Medical Named Entity Recognition Combined With Active Learning
10	Medical Text Information Extraction Based On Deep Learning