Research And Application Of Entity Analysis And Automatic Coding Technology For Medical Data

Posted on:2020-09-02

Degree:Master

Type:Thesis

Country:China

Candidate:X F Hou

Full Text:PDF

GTID:2404330602968351

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

In recent years,with the popularization of the domestic Electronic Health Records system,the accumulation of medical texts has increased.The medical texts contain a large number of important information of patients,such as diseases,symptoms,diagnosis and treatment,etc.These data play an important role in the subsequent related work,such as disease analysis and disease prevention.Therefore,the mining and analysis of Electronic Health Records have been received more and more attention in the field of natural language processing.The information on Electronic Health Records is stored in text form,and the terms of disease and symptom descriptions are not uniform due to the doctor’s personal habits during writing the medical record,which will lead to the errors in the work of docking medical expense payment systems,medical data statistics and so on.Therefore,it is important to map clinical text data to a standard terminological database,that is,to represent text in code.This thesis studies the entity analysis and automatic coding of medical data.The specific research contents are as follows:1)A clinical text entity recognition method based on Att-Bi-LSTM-CRF is proposed.This method incorporates Chinese word embeddings with stroke n-gram information(cw2vec)into Bi-directional Long Short-Terms Memory(Bi-LSTM)network and uses an attention mechanism to determine how much information to use.Finally,in order to make the prediction label more reasonable,the conditional random field(CRF)is used for labeling.2)A short text clustering method based on convolutional neural network and K-means is proposed.The short text data of the disease is simple to express,so this thesis expands the short text data by the external ICD-10 terminological database,and the word2 vec learns the expanded short text representation,then uses the convolutional neural network to learn the deep feature representation and realizes the clustering through K-means.3)An automatic disease coding method based on deep learning and examples is proposed.This thesis merges multi-method,including deep learning,similarity calculation,example-based comparative table.The neural network learns the mapping relationship between text and coding from the training data to realize the coding prediction.The similarity calculation based on TF-IDF is used to select the coding that similarity with the disease.The example-based method is used to solve the problematic coding.The experimental results prove that the method proposed in this thesis is effective.For the disease or diagnosis description in medical data,the accuracy of the entity recognition method based on deep learning model is about 82%.The expansion of disease short text,convolutional neural network and traditional K-means algorithm can complete the short text clustering of disease.The deep learning method solves the most frequently used coding in hospital diagnosis,the similarity calculation and the example-based comparative table solve the coding that is infrequent and difficult to judge in the hospital.By combining deep learning and example-based methods,the coding types are covered as much as possible,and the accuracy of automatic disease coding is improved.At last,this thesis describes the exiting problem and the further research plans.

Keywords/Search Tags:

Medical Data, Clinical Text Named Entity Recognition, Short Text Clustering, Automatic Coding, Deep Learning

PDF Full Text Request

Related items

1	Medical Text Information Extraction Based On Deep Learning
2	Research On Named Entity Recognition And Entity Relationship Extraction Of Medical Data Text Based On Attention
3	GAN-based Named Entity Recognition For TCM Text
4	Medical Text Named Entity Recognition Based On Improved Sequence Labeling Model
5	Named Entity Recognition In Chinese Medical Text Based On Lattice LSTM
6	Named Entity Recognition In Medical Field Based On Deep Learning Of Chinese
7	Research On Knowledge Extraction Technology For Chinese Medical Text
8	Research On Information Extraction Technology For Medical Text
9	Research And Realization Of Medical Case Automatic Generation Based On Named Entity Recognition
10	Research On Named Entity Recognition Technology For TCM Field