| Data security Reliable storage is crucial to the safe and stable operation of the production,transmission,transaction,consumption and analysis of the Energy Internet.The existing distribution storage technology based on Erasure Code(EC)has better improved the efficiency and reliability of data storage under the Energy Internet,and reduced storage overhead.However,when the energy Internet data is damaged for recovery,the EC-based storage mode causes high latency and high repair cost of reading downgrade during data recovery due to the consumption of network traffic and disk I/O.In order to solve the above shortcomings,combined with the frequency of energy Internet data interaction,this paper proposes A Lightweight Dynamic Storage Algorithm with Adaptive Encoding(LDSA-AE)based on adaptive code to balance storage overhead and data recovery performance.Experiments have verified the effectiveness of the algorithm,and the algorithm proposed in this paper is better than other comparative algorithms in terms of reliability,storage overhead,and repair throughput.The main work of this paper includes the following points:First of all,in order to improve the performance and storage efficiency of erasure code,this paper proposes an erasure code adaptive system parameter optimization and tuning suitable for distribution storage.At the same time,for the erasure code under the distribution dynamic cloud storage system,including the parameter constraints of RS code,CRS code and MDR code,a comprehensive algorithm and rule parameter self-tuning strategy is proposed to optimize the parameters to meet the actual storage requirements of the energy Internet and maintain the optimal state of system performance.Secondly,in order to quickly distinguish between active and inactive data of the Energy Internet and the impact of the adaptive dynamic storage algorithm proposed in this paper,this paper proposes A Density Clustering-Based Unsupervised Automatic Clustering Algorithm(DCUACA).For inactive data,an inactive data storage mechanism based on CRS code is proposed to implement low overhead and high reliability of inactive data storage and recovery.For active data,an active data storage mechanism based on replication strategy and MDR code is proposed,and the implementation of active data storage and recovery is low latency,low overhead,high throughput and high reliability.Finally,in order to evaluate the performance of the LDSA-AE algorithm,this paper designs and implements a massive data distributed storage management system based on Hadoop with high reliability,security and support for highly concurrent access.The design and performance testing of each sub-module are described in detail,and then the test results are analyzed,and finally the advantages and disadvantages of the client side design of the prototype system are evaluated. |