| High-speed railway is an important mode of transportation in China,and the catenary system plays an important role in the high-speed railway power supply system.Therefore,the industry has put forward higher requirements for the reliability of power supply of catenary system of high-speed railway.At present,the operation and maintenance data of China’s high-speed railway catenary system is continuously recorded with the operation of the high-speed railway power supply safety detection and monitoring system(6C system).With the continuous increase of data volume and the continuous development of computer technology,the establishment of a fault prediction and health management system based on high-speed railway catenary has become an important breakthrough to improve its safety and reliability.In this paper,the high-speed railway catenary data is taken as the research object.And the coding rules of it are formulated.The data discretization method is used to verify the correctness and rationality of conceptual layering of numerical data based on the expert experience.And then the data cube algorithm has been introduced into the multi-dimensional query of high-speed railway catenary data in order to improve query efficiency.The purpose of data multi-dimensional fast query is realized.Mainly complete the following work:1)Research on data discretization methods.Through the analysis and comparison,the ChiMerge discretization algorithm is used to verify the correctness and rationality of conceptual layering of numerical data in high-speed railway catenary.Based on the data characteristics of the high-speed railway catenary data,the algorithm program is optimized.The experiment and analysis is carried out by the measured data of high-speed railway catenary over the years,which verifies the correctness and rationality of conceptual layering of numerical data based on the expert experience.2)Research on the data cube algorithm.Analyze and compare the advantages and disadvantages of the current main algorithms and the scope of application,combined with the characteristics of high-speed railway catenary current data,the Dwarf algorithm is finally introduced into the multi-dimensional searching of high-speed rail catenary data.And the purpose of data multi-dimensional fast query is realized is realized.3)In view of the data characteristics of the high-speed rail catenary and the shortcomings of the Dwarf algorithm,the author optimizes the storage structure of the Dwarf cube.And a condition for judging the uniqueness of the unit values in the node is added during the suffix merging process of the Dwarf cube to reduce the query response time and improve its efficiency of query data.Finally,the high-speed railway catenary measured datawas used for experiments.Through the experimental comparison with the original algorithm,the advantage of the optimized algorithm in query performance is verified. |