Research And Implementation Of Strategies To Optimize The SSD-Based Key-Value System

Posted on:2015-08-04

Degree:Master

Type:Thesis

Country:China

Candidate:Y Z Liu

Full Text:PDF

GTID:2348330509960637

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

The rapid development of Internet technology gave birth to massive data, more and more data-centric applications penetrated into every aspects of people’s life. These applications put forward higher requirements on storage systems. Among them, how to establish an efficient index for these data became a big challenge. Key-Value system is an effective solution to establish an index of massive data, it has features of low operational latency and high throughput. However, the access to the underlying storage device of Key-Value system is mainly random access, disk-based Key-Value system is sure to have poor performance. In contrast, flash-based SSD has good random access performance and is an ideal storage device to construct the Key-Value system. This paper mainly studies the lookup optimization strategies of SSD-based Key-Value system, the main work and innovations are as follows:First, we propose a read and write optimization mechanism that bases on chain compression, for the problem of the poor compression performance of the SSD-based Key-Value System called SkimpyStash. Compression activity can write a chain of Key-Value pairs that in different SSD pages to the same page, the strategy can reduce compression times by adding a compression counter in the hash table directory, so as to effectively reduce the impact of compression activity on the client. Experiments show that the total compression times reduce 10% to 34% on average. In addition, we test the effect of compression activity on the average lookup time, pointing out the importance of a reasonable compression parameter.Secondly, we propose a Get optimization strategy based on hot data identification and relevance detection, for the problem of massive Key-Value pairs have different access heat and some of the Key-Value pairs have relevance in the whole system. The strategy can reduce the read operations of SSD by putting the hot data in the head of the chain and putting the relevant data in the same page, so as to effectively reduce the time of Get on the hot Key-Value pairs and relevant Key-Value pairs. Experiments show that the hot data identification strategy can reduce the SSD read operations by 6%, the relevance detection strategy can reduce more than 10%.Finally, we propose using file mapping mechanism to reduce the access of SSD, for the problem of the high throughput requirement in the Key-Value system, so as to increase the system throughput and reduce access latency. The mechanism will map a certain size of file into the memory when there is a read or write operation. Experiments show that the optimization can reduce the average lookup time by 21% to 30%.

Keywords/Search Tags:

Key-Value System, SSD, Compression, hot data identification, relevance detection, file mapping

PDF Full Text Request

Related items

1	Research On File Type Identification Technology
2	Bank Payment System File Conversion Module Design And Implementation
3	Research And Design Of NAND FLASH File System Based On VxWorks Platform
4	Target Identification Of MMW Active/Passive Compounding Short-Range Detecting System
5	Research On JPEG Re-compression Detection And Anti-detection Based On Header File Modificatio
6	Research And Application On File Fragments Identification And Reassembly Technology
7	Research And Implementation Of Flash-based Storage Systems By Co-Designing File System And FTL
8	Research On Fundamental Theory And Optimization Of Differential Compression
9	Research Of Dynamic Detection System For Web Based File Management
10	Application Research In V-SLAM Loop Clusure Detection System With Deep Model Compression