Please use this identifier to cite or link to this item:
http://10.1.7.192:80/jspui/handle/123456789/202
Title: | Pre-Transform Scanning for Video Compression |
Authors: | Thaker, Trushal |
Keywords: | Computer 2006 Project Report 2006 Computer Project Report Project Report 06MCE 06MCE017 |
Issue Date: | 1-Jun-2008 |
Publisher: | Institute of Technology |
Series/Report no.: | 06MCE017 |
Abstract: | Most video compression algorithms today use block based coding. An image or video frame data is coded one block at a time. A block of data (or residual data block in inter frames) is de-correlated using transform coding to exploit spatial redundancies in the data blocks. Transforms such as 2D DCT are used to de-correlate the video/image data to improve compression performance. The transforms used have a strong energy compaction property; after a transform is applied, most of the signal information is concentrated in a few low-frequency coefficients and results in fewer non-zero coefficients. Fewer non-zero coefficients lead to a lower bitrate and higher compression. In this thesis we have explored novel ways of improving the correlation of the data in a block in order to reduce the number of non-zero coefficients. The number of non-zero coefficients in the transformed data decrease with the increase in spatial correlation in the input data. Similarities among neighboring blocks are exploited by permuting block data using the sorting permutation of the neighbors to improve the correlation. We show that this permutation based approach reduces the number of non-zero coefficients significantly and improves compression. Performance of this method is demonstrated in an implementation in Xvid MPEG-4 video encoder. From the results we were able to come to certain conclusions regarding the Peak Signal to Noise Ratio (PSNR) and the average number of non zero coefficients (NZC) after transformation. We were able to decrease the average NZC per frame but with loss in PSNR. Consistency was maintained as the approach was extended and implemented for various approaches and also implementation in the Xvid MPEG-4 encoder. Here in the Xvid Mpeg-4 encoder implementation we also concluded that we need a different Variable length Coder (VLC) that supports this approach as the final bitstream resulting out of the VLC was increasing in size. From the study of the various results from all the different approaches we came to a conclusion that we need to have a new VLC for this approach as concluded from the Xvid implementation results. The proposed approach can also be extended and explored as a post transform scanning approach to see if this approach can be made to loose less PSNR. |
URI: | http://hdl.handle.net/123456789/202 |
Appears in Collections: | Dissertation, CE |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
06MCE017.pdf | 06MCE017 | 1.89 MB | Adobe PDF | ![]() View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.