A Cloud Storage Deduplication And LRI Attack Mitigation Scheme Based On Data Popularity

Posted on:2024-03-30

Degree:Master

Type:Thesis

Country:China

Candidate:Q Q Yang

Full Text:PDF

GTID:2568307175972319

Subject:Master of Electronic Information (Professional Degree)

Abstract/Summary:

PDF Full Text Request

Data deduplication technology is widely used in cloud storage because it can mitigate the storage pressure of cloud server.Among many data deduplication schemes,the client-based cross-user deduplication scheme outperforms the others due to its low space consumption and low bandwidth consumption.In the client-based cross-user deduplication scheme,the client will send the file hash to the cloud server to determine whether the file has been outsourced by other users before uploading the file,and then the cloud server will return a Yes or No response to the client indicating the existence of the file.However,the response information can be used as a side channel and be exploited by adversaries to compromise data privacy.Especially,when the attacker knows most of the content of a file,he can obtain the remaining file information through brute force learn-the-remaining-information(LRI)attack.In practical applications,the popularity distribution of data in the cloud is highly skewed,popular data is the main origin of redundant data,while sensitive data is concentrated on unpopular data,but existing deduplication schemes and LRI attack mitigation schemes are not well suited for this data distribution.For such data distribution,how to effectively deduplicate and protect data security at low cost has become a new problem.To solve the above problem,the following work has been done in this paper:(1)To achieve efficient deduplication of unevenly distributed data,this paper proposed a novel bloom filter variant named popularity dynamic bloom filter(PDBF),which incorporates data popularity into bloom filter.Moreover,a PDBF-based deduplication scheme was constructed to perform different degrees of deduplication depending on how popular a datum is.High-accuracy deduplication is performed on popular data with high redundancy,and low-accuracy deduplication is performed on unpopular data with low redundancy.The experiments demonstrate that the scheme makes an excellent tradeoff among the computational time,the memory consumption,and the deduplication efficiency.(2)To mitigate the learning the remaining information attack with lower cost,this paper proposed a variable randomized redundant chunk scheme(VRCS).The main idea behind VRCS is to provide more fine-grained protection based on data popularity.It focuses on protecting the sensitive chunks of the unpopular files by calculating the file popularity according to the chunk popularity and variably adding random redundant chunks to mix up the real deduplication status of files.In addition,the VRCS prototype was evaluated on a real-world dataset,the experiments demonstrate that the VRCS performs better in bandwidth efficiency compared to existing works with no change in security.

Keywords/Search Tags:

Data deduplication, LRI attack, Data popularity, Bloom filter

PDF Full Text Request

Related items

1	Research On Security Deduplication Technology Of Cloud Storage Encrypted Data
2	Research And Application Of Data Deduplication Technology Based On Bloom Filter
3	The Design And Implementation Of Data Deduplication With Garbage Data Removal Policy
4	Big Data Encryption Algorithm Based On Data Deduplication Technology
5	An Educational Resources Management System Based On Data Deduplication
6	Research On Similarity-based Secure Data Deduplication In Cloud Computing
7	Research In Data-deduplication Based On Storage System
8	Researches And Applications On Efficient Bloom Filter For Big Data
9	Data Security Storage And Deduplication Technolog
10	Research On Data Security Deduplication In Cloud Storage