CUDA-based Inter-frame Prediction Optimization And Parallelization

Posted on:2015-10-15

Degree:Master

Type:Thesis

Country:China

Candidate:P C Wang

Full Text:PDF

GTID:2308330452455848

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

As the main module of H.264/AVC, inter frame prediction mainly used to remove thetemporal redundancy in video sequence to improve compression rate. While the wholemodule is time-consuming, high resource utilization also makes it become the bottleneck ofperformance improvement. Meanwhile, with the improvement of GPU computing capacityand the mature CUDA(Compute Unified Device Architecture) platform, more and morecomputing-intensive applications has been migrated to GPU. As the bottleneck of the H.264,considering how to use CUDA to accelerate the inter-frame prediction module has becomethe hot topic in the field of video compression and high performance computing.For the reason that high data dependency in traditional motion estimation algorithm, itis difficult to adapt to CUDA SIMT calculation model. On the other hand, through theexperiment on different kinds of video data, there exists high correlation between codingdata of inter-layer. Based on this, we optimized the module from four aspects asfollows:(1)re-organize the work flow of the inter-frame prediction module to make it moreproper to CUDA;(2)propose motion tendency oriented motion estimation algortihm tomake full use of computing resources on CUDA which is caused by strong data dependencyon fast search algorithm;(3)propose and realize preliminary search mechanism based ondomain partition and matching of sampling to reduce single thread computing load andmake full use of neighborhood information;(4)put forwad and realize model mergingalgorithm based on inter-layer prediction as a result of inter-scale dependency of motionvector.As the experimental result shows that, when compared with the full search algorithmthe adaptive iterative search algorithm can achieve70~80times speedup and ensure codedframe SNR loss under0.5dB at the same time. When compared with the mainstream fullsearch algorithm, the proposed algorithm not only can lift speed more than3times, it canalso achieve better coding effect. When compared with classic CUDA-based motionestimation algorithm, the ratio can achieve about20%and coded frame SNR loses under0.5dB at the same time.

Keywords/Search Tags:

Video Compression, CUDA, Iter-frame prediction, Adaptive motion estimation

PDF Full Text Request

Related items

1	Based On Mpeg-2 Digital Video Compression Technology
2	Research On Inter-Frame-Prediction Technology In Video Compression Based On VLSI Architecture
3	Research On Motion Estimation Algorithms In Video Compression
4	Optimization Research And Implementation Of Full Search Motion Estimation Algorithm Based On GPU Platform
5	Research Of Motion Estimation Algorithm For Video Compression Applications
6	Research On Algorithm And Application For New Generation Video Compression Standards
7	Research And Detailed Design On Inter-frame Prediction Module In Real-time H.264HD CODEC
8	Research On Video Prediction Based On H.264/AVC
9	Based Video Compression Standard H.264 Key Technology Research
10	Research On Video Compressed And Traffic Model In Wireless Multimedia Network