Infrared and Visible Image Fusion Techniques Based on Deep Learning: A Review

Changqi Sun; Cong Zhang; Naixue Xiong

doi:10.3390/electronics9122162

Infrared and Visible Image Fusion Techniques Based on Deep Learning: A Review

Electronics ◽

10.3390/electronics9122162 ◽

2020 ◽

Vol 9 (12) ◽

pp. 2162

Author(s):

Changqi Sun ◽

Cong Zhang ◽

Naixue Xiong

Keyword(s):

Deep Learning ◽

Image Fusion ◽

Research Work ◽

Image Features ◽

Future Research ◽

Learning Methods ◽

Visible Image ◽

Advantages And Disadvantages ◽

Fusion Methods ◽

Future Work

Infrared and visible image fusion technologies make full use of different image features obtained by different sensors, retain complementary information of the source images during the fusion process, and use redundant information to improve the credibility of the fusion image. In recent years, many researchers have used deep learning methods (DL) to explore the field of image fusion and found that applying DL has improved the time-consuming efficiency of the model and the fusion effect. However, DL includes many branches, and there is currently no detailed investigation of deep learning methods in image fusion. In this work, this survey reports on the development of image fusion algorithms based on deep learning in recent years. Specifically, this paper first conducts a detailed investigation on the fusion method of infrared and visible images based on deep learning, compares the existing fusion algorithms qualitatively and quantitatively with the existing fusion quality indicators, and discusses various fusions. The main contribution, advantages, and disadvantages of the algorithm. Finally, the research status of infrared and visible image fusion is summarized, and future work has prospected. This research can help us realize many image fusion methods in recent years and lay the foundation for future research work.

Download Full-text

Infrared and visible image fusion via octave Gaussian pyramid framework

Scientific Reports ◽

10.1038/s41598-020-80189-1 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Lei Yan ◽

Qun Hao ◽

Jie Cao ◽

Rizvi Saad ◽

Kun Li ◽

...

Keyword(s):

Image Fusion ◽

Low Frequency ◽

Objective Evaluation ◽

Visible Image ◽

Composite Image ◽

Multiscale Decomposition ◽

Gaussian Pyramid ◽

Fusion Methods ◽

Scale Spaces ◽

Fusion Framework

AbstractImage fusion integrates information from multiple images (of the same scene) to generate a (more informative) composite image suitable for human and computer vision perception. The method based on multiscale decomposition is one of the commonly fusion methods. In this study, a new fusion framework based on the octave Gaussian pyramid principle is proposed. In comparison with conventional multiscale decomposition, the proposed octave Gaussian pyramid framework retrieves more information by decomposing an image into two scale spaces (octave and interval spaces). Different from traditional multiscale decomposition with one set of detail and base layers, the proposed method decomposes an image into multiple sets of detail and base layers, and it efficiently retains high- and low-frequency information from the original image. The qualitative and quantitative comparison with five existing methods (on publicly available image databases) demonstrate that the proposed method has better visual effects and scores the highest in objective evaluation.

Download Full-text

A Review of Computer-Aided Expert Systems for Breast Cancer Diagnosis

Cancers ◽

10.3390/cancers13112764 ◽

2021 ◽

Vol 13 (11) ◽

pp. 2764

Author(s):

Xin Yu Liew ◽

Nazia Hameed ◽

Jeremie Clos

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Deep Learning ◽

Main Process ◽

Learning Approaches ◽

Learning Methods ◽

Advantages And Disadvantages ◽

Computer Aided ◽

Conventional Methods ◽

The Impact

A computer-aided diagnosis (CAD) expert system is a powerful tool to efficiently assist a pathologist in achieving an early diagnosis of breast cancer. This process identifies the presence of cancer in breast tissue samples and the distinct type of cancer stages. In a standard CAD system, the main process involves image pre-processing, segmentation, feature extraction, feature selection, classification, and performance evaluation. In this review paper, we reviewed the existing state-of-the-art machine learning approaches applied at each stage involving conventional methods and deep learning methods, the comparisons within methods, and we provide technical details with advantages and disadvantages. The aims are to investigate the impact of CAD systems using histopathology images, investigate deep learning methods that outperform conventional methods, and provide a summary for future researchers to analyse and improve the existing techniques used. Lastly, we will discuss the research gaps of existing machine learning approaches for implementation and propose future direction guidelines for upcoming researchers.

Download Full-text

Infrared and visible image fusion methods and applications: A survey

Information Fusion ◽

10.1016/j.inffus.2018.02.004 ◽

2019 ◽

Vol 45 ◽

pp. 153-178 ◽

Cited By ~ 217

Author(s):

Jiayi Ma ◽

Yong Ma ◽

Chang Li

Keyword(s):

Image Fusion ◽

Visible Image ◽

Fusion Methods

Download Full-text

An infrared and visible image fusion method based on deep learning

4th Optics Young Scientist Summit (OYSS 2020) ◽

10.1117/12.2591303 ◽

2021 ◽

Author(s):

Dawei Zhang ◽

Kan Ren ◽

Jing Zhou ◽

Guohua Gu ◽

Qian Chen

Keyword(s):

Deep Learning ◽

Image Fusion ◽

Fusion Method ◽

Visible Image ◽

Image Fusion Method

Download Full-text

Deep Learning-Based Monocular Depth Estimation Methods—A State-of-the-Art Review

Sensors ◽

10.3390/s20082272 ◽

2020 ◽

Vol 20 (8) ◽

pp. 2272 ◽

Cited By ~ 5

Author(s):

Faisal Khan ◽

Saqib Salahuddin ◽

Hossein Javidnia

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Research Work ◽

Depth Estimation ◽

Autonomous Driving ◽

Estimation Methods ◽

Future Research ◽

Comprehensive Overview ◽

Ill Posed ◽

Monocular Depth

Monocular depth estimation from Red-Green-Blue (RGB) images is a well-studied ill-posed problem in computer vision which has been investigated intensively over the past decade using Deep Learning (DL) approaches. The recent approaches for monocular depth estimation mostly rely on Convolutional Neural Networks (CNN). Estimating depth from two-dimensional images plays an important role in various applications including scene reconstruction, 3D object-detection, robotics and autonomous driving. This survey provides a comprehensive overview of this research topic including the problem representation and a short description of traditional methods for depth estimation. Relevant datasets and 13 state-of-the-art deep learning-based approaches for monocular depth estimation are reviewed, evaluated and discussed. We conclude this paper with a perspective towards future research work requiring further investigation in monocular depth estimation challenges.

Download Full-text

Infrared and Visible Image Fusion through Details Preservation

Sensors ◽

10.3390/s19204556 ◽

2019 ◽

Vol 19 (20) ◽

pp. 4556 ◽

Cited By ~ 2

Author(s):

Yaochen Liu ◽

Lili Dong ◽

Yuanyuan Ji ◽

Wenhai Xu

Keyword(s):

Image Fusion ◽

Subjective Evaluation ◽

Objective Assessment ◽

Decomposition Methods ◽

Source Image ◽

Fusion Method ◽

Visible Image ◽

Weighting Method ◽

Fused Image ◽

Fusion Methods

In many actual applications, fused image is essential to contain high-quality details for achieving a comprehensive representation of the real scene. However, existing image fusion methods suffer from loss of details because of the error accumulations of sequential tasks. This paper proposes a novel fusion method to preserve details of infrared and visible images by combining new decomposition, feature extraction, and fusion scheme. For decomposition, different from the most decomposition methods by guided filter, the guidance image contains only the strong edge of the source image but no other interference information so that rich tiny details can be decomposed into the detailed part. Then, according to the different characteristics of infrared and visible detail parts, a rough convolutional neural network (CNN) and a sophisticated CNN are designed so that various features can be fully extracted. To integrate the extracted features, we also present a multi-layer features fusion strategy through discrete cosine transform (DCT), which not only highlights significant features but also enhances details. Moreover, the base parts are fused by weighting method. Finally, the fused image is obtained by adding the fused detail and base part. Different from the general image fusion methods, our method not only retains the target region of source image but also enhances background in the fused image. In addition, compared with state-of-the-art fusion methods, our proposed fusion method has many advantages, including (i) better visual quality of fused-image subjective evaluation, and (ii) better objective assessment for those images.

Download Full-text

Salient Object Detection Techniques in Computer Vision—A Survey

Entropy ◽

10.3390/e22101174 ◽

2020 ◽

Vol 22 (10) ◽

pp. 1174

Author(s):

Ashish Kumar Gupta ◽

Ayan Seal ◽

Mukesh Prasad ◽

Pritee Khanna

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Object Detection ◽

Large Scale ◽

Research Work ◽

Salient Object Detection ◽

Future Research ◽

Automatic Identification ◽

Salient Object ◽

Detection Techniques

Detection and localization of regions of images that attract immediate human visual attention is currently an intensive area of research in computer vision. The capability of automatic identification and segmentation of such salient image regions has immediate consequences for applications in the field of computer vision, computer graphics, and multimedia. A large number of salient object detection (SOD) methods have been devised to effectively mimic the capability of the human visual system to detect the salient regions in images. These methods can be broadly categorized into two categories based on their feature engineering mechanism: conventional or deep learning-based. In this survey, most of the influential advances in image-based SOD from both conventional as well as deep learning-based categories have been reviewed in detail. Relevant saliency modeling trends with key issues, core techniques, and the scope for future research work have been discussed in the context of difficulties often faced in salient object detection. Results are presented for various challenging cases for some large-scale public datasets. Different metrics considered for assessment of the performance of state-of-the-art salient object detection models are also covered. Some future directions for SOD are presented towards end.

Download Full-text

Deep learning for drug–drug interaction extraction from the literature: a review

Briefings in Bioinformatics ◽

10.1093/bib/bbz087 ◽

2019 ◽

Vol 21 (5) ◽

pp. 1609-1627 ◽

Cited By ~ 5

Author(s):

Tianlin Zhang ◽

Jiaxu Leng ◽

Ying Liu

Keyword(s):

Deep Learning ◽

Drug Research ◽

Drug Effects ◽

Learning Technologies ◽

Biomedical Literature ◽

Learning Approaches ◽

Learning Methods ◽

Feature Representations ◽

Advantages And Disadvantages ◽

Interaction Extraction

AbstractDrug–drug interactions (DDIs) are crucial for drug research and pharmacovigilance. These interactions may cause adverse drug effects that threaten public health and patient safety. Therefore, the DDIs extraction from biomedical literature has been widely studied and emphasized in modern biomedical research. The previous rules-based and machine learning approaches rely on tedious feature engineering, which is labourious, time-consuming and unsatisfactory. With the development of deep learning technologies, this problem is alleviated by learning feature representations automatically. Here, we review the recent deep learning methods that have been applied to the extraction of DDIs from biomedical literature. We describe each method briefly and compare its performance in the DDI corpus systematically. Next, we summarize the advantages and disadvantages of these deep learning models for this task. Furthermore, we discuss some challenges and future perspectives of DDI extraction via deep learning methods. This review aims to serve as a useful guide for interested researchers to further advance bioinformatics algorithms for DDIs extraction from the literature.

Download Full-text

Infrared and Visible Image Fusion Using Truncated Huber Penalty Function Smoothing and Visual Saliency Based Threshold Optimization

Electronics ◽

10.3390/electronics11010033 ◽

2021 ◽

Vol 11 (1) ◽

pp. 33

Author(s):

Chaowei Duan ◽

Yiliu Liu ◽

Changda Xing ◽

Zhisheng Wang

Keyword(s):

Image Fusion ◽

Penalty Function ◽

Visual Saliency ◽

Fusion Rule ◽

Visible Image ◽

Threshold Optimization ◽

Fused Image ◽

Fusion Methods ◽

Visible Images ◽

Image Representations

An efficient method for the infrared and visible image fusion is presented using truncated Huber penalty function smoothing and visual saliency based threshold optimization. The method merges complementary information from multimodality source images into a more informative composite image in two-scale domain, in which the significant objects/regions are highlighted and rich feature information is preserved. Firstly, source images are decomposed into two-scale image representations, namely, the approximate and residual layers, using truncated Huber penalty function smoothing. Benefiting from the edge- and structure-preserving characteristics, the significant objects and regions in the source images are effectively extracted without halo artifacts around the edges. Secondly, a visual saliency based threshold optimization fusion rule is designed to fuse the approximate layers aiming to highlight the salient targets in infrared images and remain the high-intensity regions in visible images. The sparse representation based fusion rule is adopted to fuse the residual layers with the goal of acquiring rich detail texture information. Finally, combining the fused approximate and residual layers reconstructs the fused image with more natural visual effects. Sufficient experimental results demonstrate that the proposed method can achieve comparable or superior performances compared with several state-of-the-art fusion methods in visual results and objective assessments.

Download Full-text

A Heterogeneous Image Fusion Method Based on DCT and Anisotropic Diffusion for UAVs in Future 5G IoT Scenarios

Wireless Communications and Mobile Computing ◽

10.1155/2020/8816818 ◽

2020 ◽

Vol 2020 ◽

pp. 1-11

Author(s):

Shuai Hao ◽

Beiyi An ◽

Hu Wen ◽

Xu Ma ◽

Keping Yu

Keyword(s):

Image Fusion ◽

Traffic Management ◽

Anisotropic Diffusion ◽

Weighted Averaging ◽

Base Layer ◽

Fusion Method ◽

Linear Superposition ◽

Visible Image ◽

Fusion Methods ◽

Visible Images

Unmanned aerial vehicles, with their inherent fine attributes, such as flexibility, mobility, and autonomy, play an increasingly important role in the Internet of Things (IoT). Airborne infrared and visible image fusion, which constitutes an important data basis for the perception layer of IoT, has been widely used in various fields such as electric power inspection, military reconnaissance, emergency rescue, and traffic management. However, traditional infrared and visible image fusion methods suffer from weak detail resolution. In order to better preserve useful information from source images and produce a more informative image for human observation or unmanned aerial vehicle vision tasks, a novel fusion method based on discrete cosine transform (DCT) and anisotropic diffusion is proposed. First, the infrared and visible images are denoised by using DCT. Second, anisotropic diffusion is applied to the denoised infrared and visible images to obtain the detail and base layers. Third, the base layers are fused by using weighted averaging, and the detail layers are fused by using the Karhunen–Loeve transform, respectively. Finally, the fused image is reconstructed through the linear superposition of the base layer and detail layer. Compared with six other typical fusion methods, the proposed approach shows better fusion performance in both objective and subjective evaluations.

Download Full-text