| With the development of digital economy,informatization,cloud computing,AI technology,industrial Internet and other new technologies in China,With the steady progress of the national strategy of "Internet plus" and "cloud computing and big data",With the advancement of information technology in all walks of life and the rise of mobile Internet,the amount of data has surged,This promotes the demand for data processing capacity,storage capacity,transmission capacity and so on,Internet Data Center(IDC)industry has developed rapidly in the past decade.Nowadays,the domestic and foreign data center industries have many problems such as poor availability of their own infrastructure in operation,Business interruption events that cause the data center to run frequently occur,and the impact and economic losses to enterprises and even society are particularly huge.How to check whether the data center is running safely and stably,and whether the daily operation and maintenance management level can run,These problems have become the focus of attention in the industry,and an index is needed to quantitatively evaluate these problems.Therefore,the availability of the data center becomes the key evaluation index.By improving the availability,the data center can run more safely and reliably,thus enhancing the core competitiveness of the data center.Based on this paper,the key research direction is to optimize the operation and maintenance management system based on improving the availability of data center.In the process of research,this paper mainly adopts the method of analyzing data center availability factors,and optimizes the operation and maintenance management system.Based on the analysis of the actual operation process of J data center infrastructure,this paper optimizes the current operation and maintenance business system.The content of this paper mainly combines the current operation and management situation of J data center closely,and analyzes the current operation and management situation of different key elements in J data center.The main content involves the equipment maintenance management process,equipment inspection management process,emergency management process and supplier management process in J data center operation and maintenance management system.On this basis,from two dimensions of MTTF(mean time before failure)and MTTR(mean time to repair),Compare and analyze the operation status of data center before and after the optimization of J data center operation and maintenance management system to evaluate the effectiveness of improving the availability of data center after the optimization of J data center operation and maintenance management system.The research conclusions are as follows:2019 J index data center facilities failure practical situation compared with the 2020 data,and USES the quantitative analysis method for analysis of the realization of the management work in the process of optimization work validity evaluation,it is concluded that the infrastructure management,inspection management,emergency management,after-sales service management system after optimization job,It can effectively improve the average pre-failure duration MTTF,thereby reducing the average recovery duration MTTR,and significantly improve the data center facility usage index. |