Rate-Distortion Techniques in Image and Video Coding

Due to the advancement of multimedia and its requirement of communication over the network, video compression has received much attention among the researchers. One of the popular video codings is scalable video coding, referred to as H.264/AVC standard. The major drawback in the H.264 is that it performs the exhaustive search over the interlayer prediction to gain the best rate-distortion performance. To reduce the computation overhead due to exhaustive search on mode prediction process, this paper presents a new technique for inter prediction mode selection based on the fuzzy holoentropy. This proposed scheme utilizes the pixel values and probabilistic distribution of pixel symbols to decide the mode. The adaptive mode selection is introduced here by analyzing the pixel values of the current block to be coded with those of a motion compensated reference block using fuzzy holoentropy. The adaptively selected mode decision can reduce the computation time without affecting the visual quality of frames. Experimentation of the proposed scheme is evaluated by utilizing five videos, and from the analysis, it is evident that proposed scheme has overall high performance with values of 41.367 dB and 0.992 for PSNR and SSIM respectively.

Download Full-text

Texture Grouping and Statistical Optimization Based Mode Prediction Decision Algorithm for Fast HEVC Intra Coding

10.21203/rs.3.rs-286035/v1 ◽

2021 ◽

Author(s):

Jianhua Wang ◽

Feng Lin ◽

Jing Zhao ◽

Yongbing Long

Keyword(s):

Computational Complexity ◽

Video Coding ◽

Rate Distortion ◽

Statistical Optimization ◽

Optimal Prediction ◽

Statistical Probability ◽

Decision Algorithm ◽

Texture Information ◽

Prediction Mode ◽

Probability Optimization

Abstract HEVC (High Efficiency Video Coding), as one of the newest international video coding standard, can achieve about 50% bit rate reduction compared with H.264/AVC (Advanced Video Coding) at the same perceptual quality due to the use of flexible CTU(coding tree unit) structure, but at the same time, it also dramatically adds the higher computational complexity for HEVC. With the aim of reducing the computational complexity, a texture grouping and statistical optimization based mode prediction decision algorithm is proposed for HEVC intra coding in this paper. The contribution of this paper lies in the fact that we successfully use the texture information grouping and statistical probability optimization technology to rapidly determine the optimal prediction mode for the current PU, which can reduce many unnecessary prediction and calculation operations of HCost (Hadamard Cost) and RDCost (Rate Distortion Cost) in HEVC, thus saving much computation complexity for HEVC. Specially, in our scheme, firstly we group 35 intra prediction modes into 5 subsets of candidate modes list according to its texture information of edge in the current PU, and each subset only contains 11 intra prediction modes, which can greatly reduce many traversing number of candidate mode in RMD (Rough Mode Decision) from 35 to 11 prediction modes; Secondly we use the statistical probability of the first candidate modes in candidate modes list as well as MPM selected as the optimal prediction mode to reduce the number of candidate modes in RDO(Rate Distortion Optimization), which can reduce the number of candidate modes from 3+MPM or 8+MPM to 2 candidate modes; At last, we use the number of candidate modes determined above to quickly find the optimal prediction mode with the minimum RDCost by RDO process. As a result, the computational complexity of HEVC can be efficiently reduced by our proposed scheme. And the simulation results of our experiments show that our proposed intra mode prediction decision algorithm based on texture information grouping and statistical probability optimization in this paper can reduce about 46.13% computational complexity on average only at a cost of 0.67% bit rate increase and 0.056db PSNR decline compared with the standard reference HM16.1 algorithm.

Download Full-text

Adaptive Downsampling Video Coding With Spatially Scalable Rate-Distortion Modeling

IEEE Transactions on Circuits and Systems for Video Technology ◽

10.1109/tcsvt.2014.2302519 ◽

2014 ◽

Vol 24 (11) ◽

pp. 1957-1968 ◽

Cited By ~ 12

Author(s):

Ren-Jie Wang ◽

Chih-Wei Huang ◽

Pao-Chi Chang

Keyword(s):

Video Coding ◽

Rate Distortion

Download Full-text

Rate-distortion optimized video coding with stopping rules: quality and complexity

2004 International Conference on Image Processing, 2004. ICIP '04. ◽

10.1109/icip.2004.1419407 ◽

2005 ◽

Author(s):

M. Moecke ◽

Rui Seara

Keyword(s):

Video Coding ◽

Rate Distortion ◽

Stopping Rules

Download Full-text

A Novel Rate-Distortion Model for Leaky Prediction Based FGS Video Coding

Advances in Multimedia Information Processing - PCM 2004 - Lecture Notes in Computer Science ◽

10.1007/978-3-540-30543-9_84 ◽

2004 ◽

pp. 673-680

Author(s):

Jianhua Wu ◽

Jianfei Cai

Keyword(s):

Video Coding ◽

Rate Distortion ◽

Distortion Model

Download Full-text

Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC

Scientific Programming ◽

10.1155/2020/8883214 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Jinchao Zhao ◽

Yihan Wang ◽

Qiuwen Zhang

Keyword(s):

Deep Learning ◽

Video Coding ◽

High Efficiency ◽

Texture Classification ◽

Rate Distortion ◽

Classification Model ◽

High Efficiency Video Coding ◽

Fast Encoding ◽

Training Samples ◽

Coding Unit

With the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth but also its split mode in the H.266/Versatile Video Coding (H.266/VVC). Although H.266/VVC achieves significant coding performance on the basis of H.265/High Efficiency Video Coding (H.265/HEVC), it causes significantly coding complexity and increases coding time, where the most time-consuming part is traversal calculation rate-distortion (RD) of CU. To solve these problems, this paper proposes an adaptive CU split decision method based on deep learning and multifeature fusion. Firstly, we develop a texture classification model based on threshold to recognize complex and homogeneous CU. Secondly, if the complex CUs belong to edge CU, a Convolutional Neural Network (CNN) structure based on multifeature fusion is utilized to classify CU. Otherwise, an adaptive CNN structure is used to classify CUs. Finally, the division of CU is determined by the trained network and the parameters of CU. When the complex CUs are split, the above two CNN schemes can successfully process the training samples and terminate the rate-distortion optimization (RDO) calculation for some CUs. The experimental results indicate that the proposed method reduces the computational complexity and saves 39.39% encoding time, thereby achieving fast encoding in H.266/VVC.

Download Full-text

Rate Distortion Analysis for Spatially Scalable Video Coding

IEEE Transactions on Image Processing ◽

10.1109/tip.2010.2051624 ◽

2010 ◽

Vol 19 (11) ◽

pp. 2947-2957 ◽

Cited By ~ 7

Author(s):

Rong Zhang ◽

Mary L Comer

Keyword(s):

Video Coding ◽

Scalable Video Coding ◽

Rate Distortion ◽

Scalable Video ◽

Distortion Analysis

Download Full-text