A Novel and Effective Image Super-Resolution Reconstruction Technique via Fast Global and Local Residual Learning Model

Jingru Hou; Yujuan Si; Xiaoqian Yu

doi:10.3390/app10051856

A Novel and Effective Image Super-Resolution Reconstruction Technique via Fast Global and Local Residual Learning Model

Applied Sciences ◽

10.3390/app10051856 ◽

2020 ◽

Vol 10 (5) ◽

pp. 1856

Author(s):

Jingru Hou ◽

Yujuan Si ◽

Xiaoqian Yu

Keyword(s):

Neural Network ◽

High Resolution ◽

Super Resolution ◽

Learning Model ◽

Linear Mapping ◽

Reconstruction Technique ◽

Low Resolution ◽

Residual Learning ◽

Image Super Resolution ◽

Global And Local

The principle of image super-resolution reconstruction (SR) is to pass one or more low-resolution (LR) images through information processing technology to obtain the final high-resolution (HR) image. Convolutional neural networks (CNN) have achieved better results than traditional methods in the process of an image super-resolution reconstruction. However, if the number of neural network layers is increased blindly, it will cause a significant increase in the amount of calculation, increase the difficulty of training the network, and cause the loss of image details. Therefore, in this paper, we use a novel and effective image super-resolution reconstruction technique via fast global and local residual learning model (FGRLR). The principle is to directly train a low-resolution small image on a neural network without enlarging it. This will effectively reduce the amount of calculation. In addition, the stacked local residual block (LRB) structure is used for non-linear mapping, which can effectively overcome the problem of image degradation. After extracting features, use 1 × 1 convolution to perform dimensional compression, and expand the dimensions after non-linear mapping, which can reduce the calculation amount of the model. In the reconstruction layer, deconvolution is used to enlarge the image to the required size. This also reduces the number of parameters. We use skip connections to use low-resolution information for reconstructing high-resolution images. Experimental results show that the algorithm can effectively shorten the running time without affecting the quality of image restoration.

Download Full-text

Dynamic Convolutional Neural Network For Image Super-resolution

Journal of Advanced College of Engineering and Management ◽

10.3126/jacem.v3i0.18808 ◽

2018 ◽

Vol 3 ◽

pp. 1

Author(s):

Anil Bhujel ◽

Dibakar Raj Pant

Keyword(s):

Neural Network ◽

High Resolution ◽

Convolutional Neural Network ◽

Super Resolution ◽

Low Resolution ◽

Scaling Factors ◽

Resolution Image ◽

High Resolution Image ◽

Image Super Resolution ◽

Up Scaling

Single image super-resolution (SISR) is a technique that reconstructs high resolution image from single low resolution image. Dynamic Convolutional Neural Network (DCNN) is used here for the reconstruction of high resolution image from single low resolution image. It takes low resolution image as input and produce high resolution image as output for dynamic up-scaling factor 2, 3, and 4. The dynamic convolutional neural network directly learns an end-to-end mapping between low resolution and high resolution images. The CNN trained simultaneously with images up-scaled by factors 2, 3, and 4 to make it dynamic. The system is then tested for the input images with up-scaling factors 2, 3 and 4. The dynamically trained CNN performs well for all three up-scaling factors. The performance of network is measured by PSNR, WPSNR, SSIM, MSSSIM, and also by perceptual.Journal of Advanced College of Engineering and Management, Vol. 3, 2017, Page: 1-10

Download Full-text

Image Super-Resolution Reconstruction Method Based on Global and Local Residual Learning

2019 IEEE 4th International Conference on Image, Vision and Computing (ICIVC) ◽

10.1109/icivc47709.2019.8981305 ◽

2019 ◽

Author(s):

Jingru Hou ◽

Yujuan Si ◽

Liangliang Li

Keyword(s):

Super Resolution ◽

Reconstruction Method ◽

Residual Learning ◽

Image Super Resolution ◽

Global And Local

Download Full-text

Semi-Coupled Convolutional Sparse Learning for Image Super-Resolution

Remote Sensing ◽

10.3390/rs11212593 ◽

2019 ◽

Vol 11 (21) ◽

pp. 2593

Author(s):

Li ◽

Zhang ◽

Jiao ◽

Liu ◽

Yang ◽

...

Keyword(s):

High Resolution ◽

Super Resolution ◽

Mapping Function ◽

Linear Mapping ◽

Sparse Learning ◽

Proposed Model ◽

Filter Size ◽

High Resolution Images ◽

Resolution Problem ◽

Image Super Resolution

In the convolutional sparse coding-based image super-resolution problem, the coefficients of low- and high-resolution images in the same position are assumed to be equivalent, which enforces an identical structure of low- and high-resolution images. However, in fact the structure of high-resolution images is much more complicated than that of low-resolution images. In order to reduce the coupling between low- and high-resolution representations, a semi-coupled convolutional sparse learning method (SCCSL) is proposed for image super-resolution. The proposed method uses nonlinear convolution operations as the mapping function between low- and high-resolution features, and conventional linear mapping can be seen as a special case of the proposed method. Secondly, the neighborhoods within the filter size are used to calculate the current pixel, improving the flexibility of our proposed model. In addition, the filter size is adjustable. In order to illustrate the effectiveness of SCCSL method, we compare it with four state-of-the-art methods of 15 commonly used images. Experimental results show that this work provides a more flexible and efficient approach for image super-resolution problem.

Download Full-text

Distillation

Machine Learning for Human Motion Analysis ◽

10.4018/978-1-60566-900-7.ch013 ◽

2010 ◽

pp. 244-264

Author(s):

Dong Seon Cheng ◽

Marco Cristani ◽

Vittorio Murino

Keyword(s):

High Resolution ◽

Super Resolution ◽

Video Data ◽

Distillation Process ◽

Low Resolution ◽

Resolution Image ◽

High Resolution Image ◽

Image Super Resolution ◽

Comparative Results ◽

Low Resolution Images

Image super-resolution is one of the most appealing applications of image processing, capable of retrieving a high resolution image by fusing several registered low resolution images depicting an object of interest. However, employing super-resolution in video data is challenging: a video sequence generally contains a lot of scattered information regarding several objects of interest in cluttered scenes. Especially with hand-held cameras, the overall quality may be poor due to low resolution or unsteadiness. The objective of this chapter is to demonstrate why standard image super-resolution fails in video data, which are the problems that arise, and how we can overcome these problems. In our first contribution, we propose a novel Bayesian framework for super-resolution of persistent objects of interest in video sequences. We call this process Distillation. In the traditional formulation of the image super-resolution problem, the observed target is (1) always the same, (2) acquired using a camera making small movements, and (3) found in a number of low resolution images sufficient to recover high-frequency information. These assumptions are usually unsatisfied in real world video acquisitions and often beyond the control of the video operator. With Distillation, we aim to extend and to generalize the image super-resolution task, embedding it in a structured framework that accurately distills all the informative bits of an object of interest. In practice, the Distillation process: i) individuates, in a semi supervised way, a set of objects of interest, clustering the related video frames and registering them with respect to global rigid transformations; ii) for each one, produces a high resolution image, by weighting each pixel according to the information retrieved about the object of interest. As a second contribution, we extend the Distillation process to deal with objects of interest whose transformations in the appearance are not (only) rigid. Such process, built on top of the Distillation, is hierarchical, in the sense that a process of clustering is applied recursively, beginning with the analysis of whole frames, and selectively focusing on smaller sub-regions whose isolated motion can be reasonably assumed as rigid. The ultimate product of the overall process is a strip of images that describe at high resolution the dynamics of the video, switching between alternative local descriptions in response to visual changes. Our approach is first tested on synthetic data, obtaining encouraging comparative results with respect to known super-resolution techniques, and a good robustness against noise. Second, real data coming from different videos are considered, trying to solve the major details of the objects in motion.

Download Full-text

CONVOLUTIONAL NEURAL NETWORK BASED DEM SUPER RESOLUTION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xli-b3-247-2016 ◽

2016 ◽

Vol XLI-B3 ◽

pp. 247-250 ◽

Cited By ~ 1

Author(s):

Zixuan Chen ◽

Xuewen Wang ◽

Zekai Xu ◽

Wenguang Hou

Keyword(s):

Neural Network ◽

High Resolution ◽

Convolutional Neural Network ◽

Super Resolution ◽

Low Resolution ◽

Practical Applications ◽

Interpolation Methods ◽

New Strategy

DEM super resolution is proposed in our previous publication to improve the resolution for a DEM on basis of some learning examples. Meanwhile, the nonlocal algorithm is introduced to deal with it and lots of experiments show that the strategy is feasible. In our publication, the learning examples are defined as the partial original DEM and their related high measurements due to this way can avoid the incompatibility between the data to be processed and the learning examples. To further extent the applications of this new strategy, the learning examples should be diverse and easy to obtain. Yet, it may cause the problem of incompatibility and unrobustness. To overcome it, we intend to investigate a convolutional neural network based method. The input of the convolutional neural network is a low resolution DEM and the output is expected to be its high resolution one. A three layers model will be adopted. The first layer is used to detect some features from the input, the second integrates the detected features to some compressed ones and the final step transforms the compressed features as a new DEM. According to this designed structure, some learning DEMs will be taken to train it. Specifically, the designed network will be optimized by minimizing the error of the output and its expected high resolution DEM. In practical applications, a testing DEM will be input to the convolutional neural network and a super resolution will be obtained. Many experiments show that the CNN based method can obtain better reconstructions than many classic interpolation methods.

Download Full-text

Cascaded SR-GAN for Scale-Adaptive Low Resolution Person Re-identification

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/541 ◽

2018 ◽

Cited By ~ 25

Author(s):

Zheng Wang ◽

Mang Ye ◽

Fan Yang ◽

Xiang Bai ◽

Shin'ichi Satoh

Keyword(s):

High Resolution ◽

Super Resolution ◽

Feature Representation ◽

Image Feature ◽

Low Resolution ◽

Open World ◽

In Series ◽

High Resolution Images ◽

Public Dataset ◽

Image Super Resolution

Person re-identification (REID) is an important task in video surveillance and forensics applications. Most of previous approaches are based on a key assumption that all person images have uniform and sufficiently high resolutions. Actually, various low-resolutions and scale mismatching always exist in open world REID. We name this kind of problem as Scale-Adaptive Low Resolution Person Re-identification (SALR-REID). The most intuitive way to address this problem is to increase various low-resolutions (not only low, but also with different scales) to a uniform high-resolution. SR-GAN is one of the most competitive image super-resolution deep networks, designed with a fixed upscaling factor. However, it is still not suitable for SALR-REID task, which requires a network not only synthesizing high-resolution images with different upscaling factors, but also extracting discriminative image feature for judging person’s identity. (1) To promote the ability of scale-adaptive upscaling, we cascade multiple SRGANs in series. (2) To supplement the ability of image feature representation, we plug-in a reidentification network. With a unified formulation, a Cascaded Super-Resolution GAN (CSR-GAN) framework is proposed. Extensive evaluations on two simulated datasets and one public dataset demonstrate the advantages of our method over related state-of-the-art methods.

Download Full-text

Self-Dictionary Regression for Hyperspectral Image Super-Resolution

Remote Sensing ◽

10.3390/rs10101574 ◽

2018 ◽

Vol 10 (10) ◽

pp. 1574 ◽

Cited By ~ 4

Author(s):

Dongsheng Gao ◽

Zhentao Hu ◽

Renzhen Ye

Keyword(s):

High Resolution ◽

Spatial Resolution ◽

Spectral Resolution ◽

Cross Correlation ◽

Hyperspectral Image ◽

High Spatial Resolution ◽

Super Resolution ◽

High Spectral Resolution ◽

Low Resolution ◽

Image Super Resolution

Due to sensor limitations, hyperspectral images (HSIs) are acquired by hyperspectral sensors with high-spectral-resolution but low-spatial-resolution. It is difficult for sensors to acquire images with high-spatial-resolution and high-spectral-resolution simultaneously. Hyperspectral image super-resolution tries to enhance the spatial resolution of HSI by software techniques. In recent years, various methods have been proposed to fuse HSI and multispectral image (MSI) from an unmixing or a spectral dictionary perspective. However, these methods extract the spectral information from each image individually, and therefore ignore the cross-correlation between the observed HSI and MSI. It is difficult to achieve high-spatial-resolution while preserving the spatial-spectral consistency between low-resolution HSI and high-resolution HSI. In this paper, a self-dictionary regression based method is proposed to utilize cross-correlation between the observed HSI and MSI. Both the observed low-resolution HSI and MSI are simultaneously considered to estimate the endmember dictionary and the abundance code. To preserve the spectral consistency, the endmember dictionary is extracted by performing a common sparse basis selection on the concatenation of observed HSI and MSI. Then, a consistent constraint is exploited to ensure the spatial consistency between the abundance code of low-resolution HSI and the abundance code of high-resolution HSI. Extensive experiments on three datasets demonstrate that the proposed method outperforms the state-of-the-art methods.

Download Full-text

Video super resolution using convolutional neural network and image fusion techniques

International Journal of Knowledge-based and Intelligent Engineering Systems ◽

10.3233/kes-190037 ◽

2021 ◽

Vol 24 (4) ◽

pp. 279-287

Author(s):

Vikas Kumar ◽

Tanupriya Choudhury ◽

Suresh Chandra Satapathy ◽

Ravi Tomar ◽

Archit Aggarwal

Keyword(s):

Neural Network ◽

High Resolution ◽

Convolutional Neural Network ◽

Image Fusion ◽

Reference Model ◽

Super Resolution ◽

Motion Blur ◽

Single Image ◽

Image Super Resolution ◽

Single Image Super Resolution

Recently, huge progress has been achieved in the field of single image super resolution which augments the resolution of images. The idea behind super resolution is to convert low-resolution images into high-resolution images. SRCNN (Single Resolution Convolutional Neural Network) was a huge improvement over the existing methods of single-image super resolution. However, video super-resolution, despite being an active field of research, is yet to benefit from deep learning. Using still images and videos downloaded from various sources, we explore the possibility of using SRCNN along with image fusion techniques (minima, maxima, average, PCA, DWT) to improve over existing video super resolution methods. Video Super-Resolution has inherent difficulties such as unexpected motion, blur and noise. We propose Video Super Resolution – Image Fusion (VSR-IF) architecture which utilizes information from multiple frames to produce a single high- resolution frame for a video. We use SRCNN as a reference model to obtain high resolution adjacent frames and use a concatenation layer to group those frames into a single frame. Since, our method is data-driven and requires only minimal initial training, it is faster than other video super resolution methods. After testing our program, we find that our technique shows a significant improvement over SCRNN and other single image and frame super resolution techniques.

Download Full-text

Super-resolution reconstruction of seismic section image via multi-scale convolution neural network

E3S Web of Conferences ◽

10.1051/e3sconf/202130301058 ◽

2021 ◽

Vol 303 ◽

pp. 01058

Author(s):

Meng-Di Deng ◽

Rui-Sheng Jia ◽

Hong-Mei Sun ◽

Xing-Li Zhang

Keyword(s):

Neural Network ◽

High Resolution ◽

Convolutional Neural Network ◽

Super Resolution ◽

Image Features ◽

Image Feature ◽

Reconstruction Method ◽

Low Resolution ◽

Seismic Section ◽

Multi Scale

The resolution of seismic section images can directly affect the subsequent interpretation of seismic data. In order to improve the spatial resolution of low-resolution seismic section images, a super-resolution reconstruction method based on multi-scale convolution is proposed. This method designs a multi-scale convolutional neural network to learn high-low resolution image feature pairs, and realizes mapping learning from low-resolution seismic section images to high-resolution seismic section images. This multi-scale convolutional neural network model consists of four convolutional layers and a sub-pixel convolutional layer. Convolution operations are used to learn abundant seismic section image features, and sub-pixel convolution layer is used to reconstruct high-resolution seismic section image. The experimental results show that the proposed method is superior to the comparison method in peak signal-to-noise ratio (PSNR) and structural similarity (SSIM). In the total training time and reconstruction time, our method is about 22% less than the FSRCNN method and about 18% less than the ESPCN method.

Download Full-text

Image Super-Resolution for MRI Images using 3D Faster Super-Resolution Convolutional Neural Network architecture

ITM Web of Conferences ◽

10.1051/itmconf/20203203044 ◽

2020 ◽

Vol 32 ◽

pp. 03044

Author(s):

Vanita Mane ◽

Suchit Jadhav ◽

Praneya Lal

Keyword(s):

Neural Network ◽

High Resolution ◽

Convolutional Neural Network ◽

Network Architecture ◽

Brain Mri ◽

Super Resolution ◽

Structural Mri ◽

Reconstruction Performance ◽

Human Connectome Project ◽

Image Super Resolution

Single image super-resolution using deep learning techniques has shown very high reconstruction performance over the last few years. We propose a novel three-dimensional convolutional neural network called 3D FSRCNN based on FSRCNN, which reinstates the high-resolution quality of structural MRI. The 3D neural network generates output brain images of high-resolution (HR) from a low-resolution (LR) input image. A simple design ensures less time complexity and high reconstruction quality. The network is trained using T1-weighted structural MRI images from the human connectome project dataset which is a large publicly available brain MRI database.

Download Full-text