Detection of Transcoding from H.264/AVC to HEVC Based on CU and PU Partition Types

Zhenzhen Zhang; Changbo Liu; Zhaohong Li; Lifang Yu; Huanma Yan

doi:10.3390/sym11111343

Detection of Transcoding from H.264/AVC to HEVC Based on CU and PU Partition Types

Symmetry ◽

10.3390/sym11111343 ◽

2019 ◽

Vol 11 (11) ◽

pp. 1343 ◽

Cited By ~ 1

Author(s):

Zhenzhen Zhang ◽

Changbo Liu ◽

Zhaohong Li ◽

Lifang Yu ◽

Huanma Yan

Keyword(s):

Support Vector Machine ◽

Video Coding ◽

High Efficiency ◽

High Accuracy ◽

Support Vector ◽

High Efficiency Video Coding ◽

Coding Efficiency ◽

Strong Robustness ◽

Distinguishing Features ◽

Coding Unit

High Efficiency Video Coding (HEVC) is a worldwide popular video coding standard due to its high coding efficiency. To make profits, forgers prefer to transcode videos from previous standards such as H.264/AVC to HEVC. To deal with this issue, an efficient method is proposed to expose such transcoded HEVC videos based on coding unit (CU) and prediction unit (PU) partition types. CU and PU partitioning are two unique syntactic units of HEVC that can reflect a video’s compression history. In this paper, CU and PU partition types of I pictures and P pictures are firstly extracted. Then, their mean frequencies are calculated and concatenated as distinguishing features, which are further sent to a support vector machine (SVM) for classification. Experimental results show that the proposed method can identify transcoded HEVC videos with high accuracy and has strong robustness against frame-deletion and shifted Group of Pictures (GOP) structure attacks.

Download Full-text

A Fast Mode Decision Algorithm for Intra Prediction in High Efficiency Video Coding

Journal of Medical Imaging and Health Informatics ◽

10.1166/jmihi.2020.2896 ◽

2020 ◽

Vol 10 (2) ◽

pp. 496-501

Author(s):

Wen Si ◽

Qian Zhang ◽

Zhengcheng Shi ◽

Bin Wang ◽

Tao Yan ◽

...

Keyword(s):

Video Coding ◽

High Efficiency ◽

Rate Increase ◽

Intra Prediction ◽

High Efficiency Video Coding ◽

Candidate List ◽

Decision Algorithm ◽

Coding Efficiency ◽

Gradient Based ◽

Coding Unit

High Efficiency Video Coding (HEVC) is the next generation video coding standard. In HEVC, 35 intra prediction modes are defined to improve coding efficiency, which result in huge computational complexity, as a large number of prediction modes and a flexible coding unit (CU) structure is adopted in CU coding. To reduce this computational burden, this paper presents a gradient-based candidate list clipping algorithm for Intra mode prediction. Experimental results show that the proposed algorithm can reduce 29.16% total encoding time with just 1.34% BD-rate increase and –0.07 dB decrease of BD-PSNR.

Download Full-text

Run-Time Deep Learning Enhanced Fast Coding Unit Decision for High Efficiency Video Coding

Journal of Circuits System and Computers ◽

10.1142/s0218126620500462 ◽

2019 ◽

Vol 29 (03) ◽

pp. 2050046

Author(s):

Xin Li ◽

Na Gong

Keyword(s):

Deep Learning ◽

Video Coding ◽

High Efficiency ◽

Random Access ◽

High Efficiency Video Coding ◽

Decision Algorithm ◽

Coding Efficiency ◽

Low Delay ◽

Cu Partition ◽

Coding Unit

The state-of-the-art high efficiency video coding (HEVC/H.265) adopts the hierarchical quadtree-structured coding unit (CU) to enhance the coding efficiency. However, the computational complexity significantly increases because of the exhaustive rate-distortion (RD) optimization process to obtain the optimal coding tree unit (CTU) partition. In this paper, we propose a fast CU size decision algorithm to reduce the heavy computational burden in the encoding process. In order to achieve this, the CU splitting process is modeled as a three-stage binary classification problem according to the CU size from [Formula: see text], [Formula: see text] to [Formula: see text]. In each CU partition stage, a deep learning approach is applied. Appropriate and efficient features for training the deep learning models are extracted from spatial and pixel domains to eliminate the dependency on video content as well as on encoding configurations. Furthermore, the deep learning framework is built as a third-party library and embedded into the HEVC simulator to speed up the process. The experiment results show the proposed algorithm can achieve significant complexity reduction and it can reduce the encoding time by 49.65%(Low Delay) and 48.81% (Random Access) on average compared with the traditional HEVC encoders with a negligible degradation (2.78% loss in BDBR, 0.145[Formula: see text]dB loss in BDPSNR for Low Delay, and 2.68% loss in BDBR, 0.128[Formula: see text]dB loss in BDPSNR for Random Access) in the coding efficiency.

Download Full-text

Hole-filling map-based coding unit size decision for dependent views in three-dimensional high-efficiency video coding

Journal of Electronic Imaging ◽

10.1117/1.jei.25.3.033020 ◽

2016 ◽

Vol 25 (3) ◽

pp. 033020 ◽

Cited By ~ 2

Author(s):

Lilin Guo ◽

Lunan Zhou ◽

Xiang Tian ◽

Yaowu Chen

Keyword(s):

Video Coding ◽

High Efficiency ◽

Three Dimensional ◽

Unit Size ◽

Hole Filling ◽

High Efficiency Video Coding ◽

Coding Unit

Download Full-text

Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC

Scientific Programming ◽

10.1155/2020/8883214 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Jinchao Zhao ◽

Yihan Wang ◽

Qiuwen Zhang

Keyword(s):

Deep Learning ◽

Video Coding ◽

High Efficiency ◽

Texture Classification ◽

Rate Distortion ◽

Classification Model ◽

High Efficiency Video Coding ◽

Fast Encoding ◽

Training Samples ◽

Coding Unit

With the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth but also its split mode in the H.266/Versatile Video Coding (H.266/VVC). Although H.266/VVC achieves significant coding performance on the basis of H.265/High Efficiency Video Coding (H.265/HEVC), it causes significantly coding complexity and increases coding time, where the most time-consuming part is traversal calculation rate-distortion (RD) of CU. To solve these problems, this paper proposes an adaptive CU split decision method based on deep learning and multifeature fusion. Firstly, we develop a texture classification model based on threshold to recognize complex and homogeneous CU. Secondly, if the complex CUs belong to edge CU, a Convolutional Neural Network (CNN) structure based on multifeature fusion is utilized to classify CU. Otherwise, an adaptive CNN structure is used to classify CUs. Finally, the division of CU is determined by the trained network and the parameters of CU. When the complex CUs are split, the above two CNN schemes can successfully process the training samples and terminate the rate-distortion optimization (RDO) calculation for some CUs. The experimental results indicate that the proposed method reduces the computational complexity and saves 39.39% encoding time, thereby achieving fast encoding in H.266/VVC.

Download Full-text

Learned Video Compression via Joint Spatial-Temporal Correlation Exploration

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6825 ◽

2020 ◽

Vol 34 (07) ◽

pp. 11580-11587

Author(s):

Haojie Liu ◽

Han Shen ◽

Lichao Huang ◽

Ming Lu ◽

Tong Chen ◽

...

Keyword(s):

Video Coding ◽

Video Compression ◽

High Efficiency ◽

Temporal Correlation ◽

Second Order ◽

Compression Method ◽

High Efficiency Video Coding ◽

First Order ◽

Coding Efficiency ◽

The Common

Traditional video compression technologies have been developed over decades in pursuit of higher coding efficiency. Efficient temporal information representation plays a key role in video coding. Thus, in this paper, we propose to exploit the temporal correlation using both first-order optical flow and second-order flow prediction. We suggest an one-stage learning approach to encapsulate flow as quantized features from consecutive frames which is then entropy coded with adaptive contexts conditioned on joint spatial-temporal priors to exploit second-order correlations. Joint priors are embedded in autoregressive spatial neighbors, co-located hyper elements and temporal neighbors using ConvLSTM recurrently. We evaluate our approach for the low-delay scenario with High-Efficiency Video Coding (H.265/HEVC), H.264/AVC and another learned video compression method, following the common test settings. Our work offers the state-of-the-art performance, with consistent gains across all popular test sequences.

Download Full-text

Feature-based fast coding unit partition algorithm for high efficiency video coding

Journal of Applied Research and Technology ◽

10.1016/j.jart.2015.06.019 ◽

2015 ◽

Vol 13 (2) ◽

pp. 205-219 ◽

Cited By ~ 8

Author(s):

Yih-Chuan Lin ◽

Jian-Cheng Lai

Keyword(s):

Video Coding ◽

High Efficiency ◽

High Efficiency Video Coding ◽

Feature Based ◽

Coding Unit

Download Full-text

Complexity reduction method for High Efficiency Video Coding encoding based on scene-change detection and image texture information

International Journal of Distributed Sensor Networks ◽

10.1177/1550147719892562 ◽

2019 ◽

Vol 15 (12) ◽

pp. 155014771989256

Author(s):

Hong-rae Lee ◽

Eun-bin Ahn ◽

A-young Kim ◽

Kwang-deok Seo

Keyword(s):

Computational Complexity ◽

Video Coding ◽

High Efficiency ◽

Image Texture ◽

Inter Prediction ◽

Scene Change ◽

High Efficiency Video Coding ◽

Scene Change Detection ◽

Real Time Processing ◽

Coding Efficiency

Recently, as demand for high-quality video and realistic media has increased, High Efficiency Video Coding has been standardized. However, High Efficiency Video Coding requires heavy cost in terms of computational complexity to achieve high coding efficiency, which causes problems in fast coding processing and real-time processing. In particular, High Efficiency Video Coding inter-coding has heavy computational complexity, and the High Efficiency Video Coding inter prediction uses reference pictures to improve coding efficiency. The reference pictures are typically signaled in two independent lists according to the display order, to be used for forward and backward prediction. If an event occurs in the input video, such as a scene change, the inter prediction performs unnecessary computations. Therefore, the reference picture list should be reconfigured to improve the inter prediction performance and reduce computational complexity. To address this problem, this article proposes a method to reduce computational complexity for fast High Efficiency Video Coding encoding using information such as scene changes obtained from the input video through preprocessing. Furthermore, reference picture lists are reconstructed by sorting the reference pictures by similarity to the current coded picture using Angular Second Moment, Contrast, Entropy, and Correlation, which are image texture parameters from the input video. Simulations are used to show that both the encoding time and coding efficiency could be improved simultaneously by applying the proposed algorithms.

Download Full-text

Steerable-Discrete-Cosine-Transform (SDCT): Hardware Implementation and Performance Analysis

Sensors ◽

10.3390/s20051405 ◽

2020 ◽

Vol 20 (5) ◽

pp. 1405 ◽

Cited By ~ 1

Author(s):

Riccardo Peloso ◽

Maurizio Capra ◽

Luigi Sole ◽

Massimo Ruo Roch ◽

Guido Masera ◽

...

Keyword(s):

Video Coding ◽

Discrete Cosine Transform ◽

Video Compression ◽

High Efficiency ◽

Frame Rate ◽

High Efficiency Video Coding ◽

Cosine Transform ◽

Coding Efficiency ◽

And Performance ◽

Efficient Video

In the last years, the need for new efficient video compression methods grown rapidly as frame resolution has increased dramatically. The Joint Collaborative Team on Video Coding (JCT-VC) effort produced in 2013 the H.265/High Efficiency Video Coding (HEVC) standard, which represents the state of the art in video coding standards. Nevertheless, in the last years, new algorithms and techniques to improve coding efficiency have been proposed. One promising approach relies on embedding direction capabilities into the transform stage. Recently, the Steerable Discrete Cosine Transform (SDCT) has been proposed to exploit directional DCT using a basis having different orientation angles. The SDCT leads to a sparser representation, which translates to improved coding efficiency. Preliminary results show that the SDCT can be embedded into the HEVC standard, providing better compression ratios. This paper presents a hardware architecture for the SDCT, which is able to work at a frequency of 188 M Hz , reaching a throughput of 3.00 GSample/s. In particular, this architecture supports 8k UltraHigh Definition (UHD) (7680 × 4320) with a frame rate of 60 Hz , which is one of the best resolutions supported by HEVC.

Download Full-text

Reinforcement learning based coding unit early termination algorithm for high efficiency video coding

Journal of Visual Communication and Image Representation ◽

10.1016/j.jvcir.2019.02.021 ◽

2019 ◽

Vol 60 ◽

pp. 276-286 ◽

Cited By ~ 5

Author(s):

Na Li ◽

Yun Zhang ◽

Linwei Zhu ◽

Wenhan Luo ◽

Sam Kwong

Keyword(s):

Reinforcement Learning ◽

Video Coding ◽

High Efficiency ◽

Early Termination ◽

High Efficiency Video Coding ◽

Coding Unit ◽

Early Termination Algorithm

Download Full-text

Early CU Depth Decision and Reference Picture Selection for Low Complexity MV-HEVC

Symmetry ◽

10.3390/sym11040454 ◽

2019 ◽

Vol 11 (4) ◽

pp. 454 ◽

Cited By ~ 6

Author(s):

Shahid Khan ◽

Nazeer Muhammad ◽

Shabieh Farwa ◽

Tanzila Saba ◽

Zahid Mahmood

Keyword(s):

High Efficiency ◽

Low Complexity ◽

High Efficiency Video Coding ◽

Coding Efficiency ◽

Temporal Location ◽

Simulation Results ◽

Selection For ◽

The Cost ◽

Coding Unit ◽

Base View

The Multi-View extension of High Efficiency Video Coding (MV-HEVC) has improved the coding efficiency of multi-view videos, but this comes at the cost of the extra coding complexity of the MV-HEVC encoder. This coding complexity can be reduced by efficiently reducing time-consuming encoding operations. In this work, we propose two methods to reduce the encoder complexity. The first one is Early Coding unit Splitting (ECS), and the second is the Efficient Reference Picture Selection (ERPS) method. In the ECS method, the decision of Coding Unit (CU) splitting for dependent views is made on the CU splitting information obtained from the base view, while the ERPS method for dependent views is based on selecting reference pictures on the basis of the temporal location of the picture being encoded. Simulation results reveal that our proposed methods approximately reduce the encoding time by 58% when compared with HTM (16.2), the reference encoder for MV-HEVC.

Download Full-text