Efficient Video Frame Interpolation Using Generative Adversarial Networks

Quang Nhat Tran; Shih-Hsuan Yang

doi:10.3390/app10186245

Efficient Video Frame Interpolation Using Generative Adversarial Networks

Applied Sciences ◽

10.3390/app10186245 ◽

2020 ◽

Vol 10 (18) ◽

pp. 6245

Author(s):

Quang Nhat Tran ◽

Shih-Hsuan Yang

Keyword(s):

Video Compression ◽

State Of The Art ◽

Motion Blur ◽

The State ◽

Frame Rate ◽

Computational Time ◽

Generative Adversarial Networks ◽

Video Frame ◽

Frame Interpolation ◽

Frame Rate Up Conversion

Frame interpolation, which generates an intermediate frame given adjacent ones, finds various applications such as frame rate up-conversion, video compression, and video streaming. Instead of using complex network models and additional data involved in the state-of-the-art frame interpolation methods, this paper proposes an approach based on an end-to-end generative adversarial network. A combined loss function is employed, which jointly considers the adversarial loss (difference between data models), reconstruction loss, and motion blur degradation. The objective image quality metric values reach a PSNR of 29.22 dB and SSIM of 0.835 on the UCF101 dataset, similar to those of the state-of-the-art approach. The good visual quality is notably achieved by approximately one-fifth computational time, which entails possible real-time frame rate up-conversion. The interpolated output can be further improved by a GAN based refinement network that better maintains motion and color by image-to-image translation.

Download Full-text

Video Frame Rate Up-Conversion via Spatio-Temporal Generative Adversarial Networks

Journal of Image and Graphics ◽

10.18178/joig.9.3.87-94 ◽

2021 ◽

Vol 9 (3) ◽

Author(s):

Naomichi Takada ◽

◽

Toshiaki Omori

Keyword(s):

Frame Rate ◽

Generative Adversarial Networks ◽

Video Frame ◽

Adversarial Networks ◽

Spatio Temporal ◽

Frame Rate Up Conversion ◽

Video Frame Rate

Download Full-text

Video Frame Interpolation via Deformable Separable Convolution

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.6634 ◽

2020 ◽

Vol 34 (07) ◽

pp. 10607-10614 ◽

Cited By ~ 2

Author(s):

Xianhang Cheng ◽

Zhenzhong Chen

Keyword(s):

State Of The Art ◽

Video Frame ◽

Kernel Size ◽

Frame Interpolation ◽

Interpolation Methods ◽

Video Frames ◽

Convolution Process ◽

Strong Performance ◽

Existing Frames ◽

Better Than

Learning to synthesize non-existing frames from the original consecutive video frames is a challenging task. Recent kernel-based interpolation methods predict pixels with a single convolution process to replace the dependency of optical flow. However, when scene motion is larger than the pre-defined kernel size, these methods yield poor results even though they take thousands of neighboring pixels into account. To solve this problem in this paper, we propose to use deformable separable convolution (DSepConv) to adaptively estimate kernels, offsets and masks to allow the network to obtain information with much fewer but more relevant pixels. In addition, we show that the kernel-based methods and conventional flow-based methods are specific instances of the proposed DSepConv. Experimental results demonstrate that our method significantly outperforms the other kernel-based interpolation methods and shows strong performance on par or even better than the state-of-the-art algorithms both qualitatively and quantitatively.

Download Full-text

Improved HEVC video compression algorithm using low-complexity frame rate up conversion

Journal of Electronic Imaging ◽

10.1117/1.jei.30.3.033015 ◽

2021 ◽

Vol 30 (03) ◽

Author(s):

Hongwei Lin ◽

Xiangqun Li ◽

Mingliang Gao ◽

Tao Li

Keyword(s):

Video Compression ◽

Low Complexity ◽

Compression Algorithm ◽

Frame Rate ◽

Frame Rate Up Conversion

Download Full-text

Fast and Simple Mixture of Softmaxes with BPE and Hybrid-LightRNN for Language Generation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33016626 ◽

2019 ◽

Vol 33 ◽

pp. 6626-6633

Author(s):

Xiang Kong ◽

Qizhe Xie ◽

Zihang Dai ◽

Eduard Hovy

Keyword(s):

Machine Translation ◽

State Of The Art ◽

The State ◽

Computational Time ◽

Memory Consumption ◽

Image Captioning ◽

Vocabulary Size ◽

Language Generation ◽

Practical Applications ◽

Coding Schemes

Mixture of Softmaxes (MoS) has been shown to be effective at addressing the expressiveness limitation of Softmax-based models. Despite the known advantage, MoS is practically sealed by its large consumption of memory and computational time due to the need of computing multiple Softmaxes. In this work, we set out to unleash the power of MoS in practical applications by investigating improved word coding schemes, which could effectively reduce the vocabulary size and hence relieve the memory and computation burden. We show both BPE and our proposed Hybrid-LightRNN lead to improved encoding mechanisms that can halve the time and memory consumption of MoS without performance losses. With MoS, we achieve an improvement of 1.5 BLEU scores on IWSLT 2014 German-to-English corpus and an improvement of 0.76 CIDEr score on image captioning. Moreover, on the larger WMT 2014 machine translation dataset, our MoSboosted Transformer yields 29.6 BLEU score for English-toGerman and 42.1 BLEU score for English-to-French, outperforming the single-Softmax Transformer by 0.9 and 0.4 BLEU scores respectively and achieving the state-of-the-art result on WMT 2014 English-to-German task.

Download Full-text

A Fast 4K Video Frame Interpolation Using a Multi-Scale Optical Flow Reconstruction Network

Symmetry ◽

10.3390/sym11101251 ◽

2019 ◽

Vol 11 (10) ◽

pp. 1251 ◽

Cited By ~ 2

Author(s):

Ahn ◽

Jeong ◽

Kim ◽

Kwon ◽

Yoo

Keyword(s):

High Resolution ◽

Optical Flow ◽

State Of The Art ◽

Interpolation Method ◽

Video Frame ◽

Frame Interpolation ◽

Multi Scale ◽

Reconstruction Scheme ◽

Flow Reconstruction

Recently, video frame interpolation research developed with a convolutional neural network has shown remarkable results. However, these methods demand huge amounts of memory and run time for high-resolution videos, and are unable to process a 4K frame in a single pass. In this paper, we propose a fast 4K video frame interpolation method, based upon a multi-scale optical flow reconstruction scheme. The proposed method predicts low resolution bi-directional optical flow, and reconstructs it into high resolution. We also proposed consistency and multi-scale smoothness loss to enhance the quality of the predicted optical flow. Furthermore, we use adversarial loss to make the interpolated frame more seamless and natural. We demonstrated that the proposed method outperforms the existing state-of-the-art methods in quantitative evaluation, while it runs up to 4.39× faster than those methods for 4K videos.

Download Full-text

Robust video frame rate up-conversion (FRUC) techniques

2009 Digest of Technical Papers International Conference on Consumer Electronics ◽

10.1109/icce.2009.5012254 ◽

2009 ◽

Cited By ~ 2

Author(s):

Tanaphol Thaipanich ◽

Ping-Hao Wu ◽

C.-C. Jay Kuo

Keyword(s):

Frame Rate ◽

Video Frame ◽

Frame Rate Up Conversion ◽

Video Frame Rate

Download Full-text

Adaptive motion estimation using warping for video frame rate up-conversion

10.1117/12.839481 ◽

2010 ◽

Author(s):

Ying Chen ◽

Mark J.T. Smith ◽

Edward Delp

Keyword(s):

Motion Estimation ◽

Frame Rate ◽

Video Frame ◽

Adaptive Motion ◽

Frame Rate Up Conversion ◽

Video Frame Rate

Download Full-text

Adaptive Temporal Frame Interpolation Algorithm for Frame Rate Up-Conversion

IEEE Consumer Electronics Magazine ◽

10.1109/mce.2019.2956208 ◽

2020 ◽

Vol 9 (3) ◽

pp. 17-21

Author(s):

Denis Vranjes ◽

Snjezana Rimac-Drlje ◽

Mario Vranjes

Keyword(s):

Frame Rate ◽

Interpolation Algorithm ◽

Frame Interpolation ◽

Temporal Frame ◽

Frame Rate Up Conversion

Download Full-text

Detecting video frame-rate up-conversion based on periodic properties of edge-intensity

Journal of Information Security and Applications ◽

10.1016/j.jisa.2015.12.001 ◽

2016 ◽

Vol 26 ◽

pp. 39-50 ◽

Cited By ~ 9

Author(s):

Yuxuan Yao ◽

Gaobo Yang ◽

Xingming Sun ◽

Leida Li

Keyword(s):

Frame Rate ◽

Video Frame ◽

Frame Rate Up Conversion ◽

Video Frame Rate

Download Full-text

Deep Learning Approach to Video Frame Rate Up-Conversion Using Bilateral Motion Estimation

2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) ◽

10.1109/apsipaasc47483.2019.9023270 ◽

2019 ◽

Author(s):

Junheum Park ◽

Chul Lee ◽

Chang-Su Kim

Keyword(s):

Deep Learning ◽

Motion Estimation ◽

Frame Rate ◽

Video Frame ◽

Learning Approach ◽

Frame Rate Up Conversion ◽

Video Frame Rate

Download Full-text