| With the expansion of the continuous development of science and technology, technical level of the field of their respective areas put forward higher requirements, medical field as people living in the world today is inseparable part, causing the more widespread attention. For massive medical data, how to get the most valuable, useful data content is the main content of this topic, that is, unstructured data structured processing. Recalling the past, doctors use medical records of patients with data is used simple text editor to achieve, due to the compiler to use simple, easy to get started and other effects, has received the most health care workers are ignorant of the green, but it can’t meet the medical document the diversity of needs.This paper according to the structure characteristics of the medical data combined with data processing tools and the data of structured design idea, divide the system into three core modules for data processing, the three modules respectively is: the data processing module, index name extraction module, and data structure module, according to the features of the above three modules of the specific operation. This paper uses a combination of the Chinese Academy of Sciences of the segmentation tool of data for cleaning, in combination with refers to entitle the extraction, the introduction of the a custom index name extraction library, strengthen the vast amounts of data processing and analysis, combined with the weighted probability of computing technology, to enhance the reliability of the extracted data, finally the input data according to the module have been extracted from the matching, and improve the efficiency of medical services in the field of, strengthen the benefits brought about by the development of science and technology, this is the subject to study the ultimate goal. |