| With the development of computer technology,data plays an important role in daily life.Data backup becomes an effective means of data protection.Remote file synchronization technology has been widely used in data backup and file synchronization,rsync(remote sync)is one of the most widely used remote file synchronization application,and its rsync algorithm can detect duplicate data between different versions of files,and synchronizes files more quickly.In view of the defects of rsync algorithm in practical application,this research proposes an improvement algorithm called Amsync(Adaptive multi-round file synchronization).It improves many aspects of rsync algorithm,as follows:(1)for the rsync algorithm,the length of chunks is single,and the chunking parameters have a great impact on the performance of the algorithm.Amsync changes the method of chunking,using CDC(Content Defined Chunking)algorithm to get chunks with variable length,reducing the sensitivity of the algorithm to the block parameters;(2)for rsync algorithm can only dig the duplicate chunks of the same file,Amsync takes a synchronization queue to mine duplicate chunks between different files,reducing the amount of data transmission during synchronization;(3)for the unbalance computation of rsync algorithm,Amsync changes the difference calculation process,making the slide window takes a distance slide in units of a block length.At the same time,in order to further optimize the chunking efficiency and computational load of the Amsync algorithm in the chunking stage,an improved algorithm WEV(Window Extreme Value)is proposed,which changes the way to determine the boundary of the block.Improve the throughput and effectively reduce the amount of calculations while chunking.Through a large number of files in the chunking experiment,the results show that the WEV algorithm is significantly superior to the CDC algorithm in terms of the cut-through throughput and CPU load.The Amsync algorithm improves the synchronization performance of the rsync algorithm.The synchronization time is reduced by about 10%compared with the rsync algorithm,and the server-side computational load is effectively reducedThis study applies the Amsync algorithm to the data sharing module of the BIM(Building Information Modeling)construction management system.Through the strategy,the files in the specified folder are automatically synchronized to other hosts,making the way of file sharing more convenient and satisfying the daily use requirements.The research results of the thesis have a good reference for remote data backup,multi-terminal file synchronization,deduplication and other aspects. |