Font Size: a A A

A Survey On The Coverage Rate Of Chinese Medical Terms Resources To Medical Documents In The Real World

Posted on:2021-03-31Degree:MasterType:Thesis
Country:ChinaCandidate:Y ChengFull Text:PDF
GTID:2504306308482944Subject:Biomedical engineering
Abstract/Summary:PDF Full Text Request
Background:With the rapid development of internet and information technology,there are increasingly comprehensive and deeper application of big data and artificial intelligence,which speeds up health informationization development in China.China holds extremely large amounts of healthcare big data resources,but in complex forms and diverse expressions.This data feature has hindered the application of healthcare big data resources.Fortunately,a large number of Chinese medical terms resources have been launched.It definitely lays an excellent foundation for the integration and application of healthcare big data.Objective:By counting the number of same terms recorded in Chinese medical terms resources and the real-world medical documents,this dissertation is aimed to explore and measure the represent ability of the Chinese medical terms resources to medical documents in the real world.Methods:This research systematically collected 15 Chinese medical terms resources including national standards and medical dictionaries,and also collected 6 types of real-world data such as guidelines,electronic medical records and any other types of text.From these,we extracted a series of medical terms including diseases and symptoms.Then a bilingual mapping process of medical terms with the internationally authoritative UMLS ontology system as the core has been constructed.That is to map Chinese medical terms to UMLS ontology by Chinese mapping and English mapping.Finally,two methods,directly string matching and UMLS guided synonymous matching,were applied to calculate coverage rate of Chinese medical terms resources to medical documents in real world.Results:Among the 69865 medical terms collected from the medical documents in real world,22183(31.75%)terms in it can be matched with Chinese medical terms resources.From the samples we studied,the coverage rate of common clause is the highest,reaching 74.29%.During the investigation,we found two critical factors affecting term coverage rate:(1)UMLS ontology can improve the coverage rate of medical terms;(2)The special forms of medical terms will reduce the coverage rate in some extent.Conclusions:Through this investigation,we found that the resources of Chinese medical terms were relatively abundant,which have enough represent ability in terms of the Chinese clinical documents.In order to promote the development and improvement of standard Chinese medical terms and ontology,we should give priority to integrate and promote of Chinese medical terms resources,rather than just rely on translation of the foreign medical ontology.
Keywords/Search Tags:Healthcare big data, Chinese medical terms resources, Terminology coverage, Real world study, Unified Medical Language System mapping matching
PDF Full Text Request
Related items