Deep-Learning Steganalysis for Removing Document Images on the Basis of Geometric Median Pruning

Shangping Zhong; Wude Weng; Kaizhi Chen; Jianhua Lai

doi:10.3390/sym12091426

Deep-Learning Steganalysis for Removing Document Images on the Basis of Geometric Median Pruning

Symmetry ◽

10.3390/sym12091426 ◽

2020 ◽

Vol 12 (9) ◽

pp. 1426

Author(s):

Shangping Zhong ◽

Wude Weng ◽

Kaizhi Chen ◽

Jianhua Lai

Keyword(s):

Deep Learning ◽

Image Quality Assessment ◽

Document Image ◽

Communication Process ◽

Secret Message ◽

Bee Colony ◽

Secret Communication ◽

Pruning Algorithms ◽

High Level ◽

Removal Model

The deep-learning steganography of current hotspots can conceal an image secret message in a cover image of the same size. While the steganography secret message is primarily removed via active steganalysis. The document image as the secret message in deep-learning steganography can deliver a considerable amount of effective information in a secret communication process. This study builds and implements deep-learning steganography removal models of document image secret messages based on the idea of adversarial perturbation removal: feed-forward denoising convolutional neural networks (DnCNN) and high-level representation guided denoiser (HGD). Further—considering the large computation cost and storage overheads of the above model—we use the document image-quality assessment (DIQA) as threshold, calculate the importance of filters using geometric median and prune redundant filters as extensively as possible through the overall iterative pruning and artificial bee colony (ABC) automatic pruning algorithms to reduce the size of the network structure of the existing vast and over-parameterized deep-learning steganography removal model, while maintaining the good removal effects of the model in the pruning process. Experiment results showed that the model generated by this method has better adaptability and scalability. Compared with the original deep-learning steganography removal model without pruning in this paper, the classic indicators params and flops are reduced by more than 75%.

Download Full-text

A deep learning approach to document image quality assessment

2014 IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2014.7025520 ◽

2014 ◽

Cited By ~ 33

Author(s):

Le Kang ◽

Peng Ye ◽

Yi Li ◽

David Doermann

Keyword(s):

Deep Learning ◽

Image Quality ◽

Quality Assessment ◽

Image Quality Assessment ◽

Document Image ◽

Learning Approach

Download Full-text

Document Image Quality Assessment with Relaying Reference to Determine Minimum Readable Resolution for Compression

Electronic Imaging ◽

10.2352/issn.2470-1173.2020.9.iqsp-323 ◽

2020 ◽

Vol 2020 (9) ◽

pp. 323-1-323-8

Author(s):

Litao Hu ◽

Zhenhua Hu ◽

Peter Bauer ◽

Todd J. Harris ◽

Jan P. Allebach

Keyword(s):

Image Quality ◽

Quality Assessment ◽

Image Quality Assessment ◽

Research Area ◽

Input Image ◽

Quality Score ◽

Document Image ◽

Digital Cameras ◽

Active Research ◽

Traditional Approaches

Image quality assessment has been a very active research area in the field of image processing, and there have been numerous methods proposed. However, most of the existing methods focus on digital images that only or mainly contain pictures or photos taken by digital cameras. Traditional approaches evaluate an input image as a whole and try to estimate a quality score for the image, in order to give viewers an idea of how “good” the image looks. In this paper, we mainly focus on the quality evaluation of contents of symbols like texts, bar-codes, QR-codes, lines, and hand-writings in target images. Estimating a quality score for this kind of information can be based on whether or not it is readable by a human, or recognizable by a decoder. Moreover, we mainly study the viewing quality of the scanned document of a printed image. For this purpose, we propose a novel image quality assessment algorithm that is able to determine the readability of a scanned document or regions in a scanned document. Experimental results on some testing images demonstrate the effectiveness of our method.

Download Full-text

Explicit-implicit dual stream network for image quality assessment

EURASIP Journal on Image and Video Processing ◽

10.1186/s13640-020-00538-y ◽

2020 ◽

Vol 2020 (1) ◽

Author(s):

Guangyi Yang ◽

Xingyu Ding ◽

Tian Huang ◽

Kun Cheng ◽

Weizheng Jin

Keyword(s):

Deep Learning ◽

Image Quality ◽

Quality Assessment ◽

Frequency Domain ◽

Image Quality Assessment ◽

Feature Fusion ◽

Learning Model ◽

Stream Network ◽

Perception System ◽

Deep Learning Model

Abstract Communications industry has remarkably changed with the development of fifth-generation cellular networks. Image, as an indispensable component of communication, has attracted wide attention. Thus, finding a suitable approach to assess image quality is important. Therefore, we propose a deep learning model for image quality assessment (IQA) based on explicit-implicit dual stream network. We use frequency domain features of kurtosis based on wavelet transform to represent explicit features and spatial features extracted by convolutional neural network (CNN) to represent implicit features. Thus, we constructed an explicit-implicit (EI) parallel deep learning model, namely, EI-IQA model. The EI-IQA model is based on the VGGNet that extracts the spatial domain features. On this basis, the number of network layers of VGGNet is reduced by adding the parallel wavelet kurtosis value frequency domain features. Thus, the training parameters and the sample requirements decline. We verified, by cross-validation of different databases, that the wavelet kurtosis feature fusion method based on deep learning has a more complete feature extraction effect and a better generalisation ability. Thus, the method can simulate the human visual perception system better, and subjective feelings become closer to the human eye. The source code about the proposed EI-IQA model is available on github https://github.com/jacob6/EI-IQA.

Download Full-text

Deep-Framework: A Distributed, Scalable, and Edge-Oriented Framework for Real-Time Analysis of Video Streams

Sensors ◽

10.3390/s21124045 ◽

2021 ◽

Vol 21 (12) ◽

pp. 4045

Author(s):

Alessandro Sassu ◽

Jose Francisco Saenz-Cogollo ◽

Maurizio Agelli

Keyword(s):

Deep Learning ◽

Real Time ◽

Video Data ◽

Video Analytics ◽

Web Based ◽

Real Time Analysis ◽

Open Source Framework ◽

Cluster Configuration ◽

Time Requirements ◽

High Level

Edge computing is the best approach for meeting the exponential demand and the real-time requirements of many video analytics applications. Since most of the recent advances regarding the extraction of information from images and video rely on computation heavy deep learning algorithms, there is a growing need for solutions that allow the deployment and use of new models on scalable and flexible edge architectures. In this work, we present Deep-Framework, a novel open source framework for developing edge-oriented real-time video analytics applications based on deep learning. Deep-Framework has a scalable multi-stream architecture based on Docker and abstracts away from the user the complexity of cluster configuration, orchestration of services, and GPU resources allocation. It provides Python interfaces for integrating deep learning models developed with the most popular frameworks and also provides high-level APIs based on standard HTTP and WebRTC interfaces for consuming the extracted video data on clients running on browsers or any other web-based platform.

Download Full-text

Feature extraction-based image steganalysis using deep learning

WEENTECH Proceedings in Energy ◽

10.32438/wpe.182021 ◽

2021 ◽

pp. 188-198

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Secure Communication ◽

Information Technologies ◽

Network Models ◽

Error Rates ◽

Multimedia Data ◽

Secret Message ◽

Neural Network Models

The innovations in advanced information technologies has led to rapid delivery and sharing of multimedia data like images and videos. The digital steganography offers ability to secure communication and imperative for internet. The image steganography is essential to preserve confidential information of security applications. The secret image is embedded within pixels. The embedding of secret message is done by applied with S-UNIWARD and WOW steganography. Hidden messages are reveled using steganalysis. The exploration of research interests focused on conventional fields and recent technological fields of steganalysis. This paper devises Convolutional neural network models for steganalysis. Convolutional neural network (CNN) is one of the most frequently used deep learning techniques. The Convolutional neural network is used to extract spatio-temporal information or features and classification. We have compared steganalysis outcome with AlexNet and SRNeT with same dataset. The stegnalytic error rates are compared with different payloads.

Download Full-text

Automated Generation of a Single Shot Detector C Library from High Level Deep Learning Frameworks

2018 IEEE 4th International Forum on Research and Technology for Society and Industry (RTSI) ◽

10.1109/rtsi.2018.8548427 ◽

2018 ◽

Author(s):

Luca Ranalli ◽

Luigi Di Stefano ◽

Emanuele Plebani ◽

Mirko Falchetto ◽

Danilo Pau ◽

...

Keyword(s):

Deep Learning ◽

Single Shot ◽

Automated Generation ◽

High Level ◽

Learning Frameworks

Download Full-text

Optimum Network/Framework Selection from High-Level Specifications in Embedded Deep Learning Vision Applications

Advanced Concepts for Intelligent Vision Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-030-01449-0_31 ◽

2018 ◽

pp. 369-379

Author(s):

Delia Velasco-Montero ◽

Jorge Fernández-Berni ◽

Ricardo Carmona-Galán ◽

Ángel Rodríguez-Vázquez

Keyword(s):

Deep Learning ◽

High Level

Download Full-text

Building Extraction in Very High Resolution Imagery by Dense-Attention Networks

Remote Sensing ◽

10.3390/rs10111768 ◽

2018 ◽

Vol 10 (11) ◽

pp. 1768 ◽

Cited By ~ 24

Author(s):

Hui Yang ◽

Penghai Wu ◽

Xuedong Yao ◽

Yanlan Wu ◽

Biao Wang ◽

...

Keyword(s):

Deep Learning ◽

High Resolution ◽

Building Extraction ◽

Learning Networks ◽

Feature Maps ◽

Low Level ◽

High Resolution Imagery ◽

Very High Resolution Imagery ◽

High Level ◽

Very High

Building extraction from very high resolution (VHR) imagery plays an important role in urban planning, disaster management, navigation, updating geographic databases, and several other geospatial applications. Compared with the traditional building extraction approaches, deep learning networks have recently shown outstanding performance in this task by using both high-level and low-level feature maps. However, it is difficult to utilize different level features rationally with the present deep learning networks. To tackle this problem, a novel network based on DenseNets and the attention mechanism was proposed, called the dense-attention network (DAN). The DAN contains an encoder part and a decoder part which are separately composed of lightweight DenseNets and a spatial attention fusion module. The proposed encoder–decoder architecture can strengthen feature propagation and effectively bring higher-level feature information to suppress the low-level feature and noises. Experimental results based on public international society for photogrammetry and remote sensing (ISPRS) datasets with only red–green–blue (RGB) images demonstrated that the proposed DAN achieved a higher score (96.16% overall accuracy (OA), 92.56% F1 score, 90.56% mean intersection over union (MIOU), less training and response time and higher-quality value) when compared with other deep learning methods.

Download Full-text

A Survey of Graphical Page Object Detection with Deep Neural Networks

10.20944/preprints202104.0739.v1 ◽

2021 ◽

Author(s):

Jwalin Bhatt ◽

Khurram Azeem Hashmi ◽

Muhammad Zeshan Afzal ◽

Didier Stricker

Keyword(s):

Deep Learning ◽

Object Detection ◽

Conceptual Understanding ◽

Deep Neural Networks ◽

State Of The Art ◽

Learning Approaches ◽

Document Images ◽

Essential Information ◽

Current State ◽

High Level

In any document, graphical elements like tables, figures, and formulas contain essential information. The processing and interpretation of such information require specialized algorithms. Off-the-shelf OCR components cannot process this information reliably. Therefore, an essential step in document analysis pipelines is to detect these graphical components. It leads to a high-level conceptual understanding of the documents that makes digitization of documents viable. Since the advent of deep learning, the performance of deep learning-based object detection has improved many folds. In this work, we outline and summarize the deep learning approaches for detecting graphical page objects in the document images. Therefore, we discuss the most relevant deep learning-based approaches and state-of-the-art graphical page object detection in document images. This work provides a comprehensive understanding of the current state-of-the-art and related challenges. Furthermore, we discuss leading datasets along with the quantitative evaluation. Moreover, it discusses briefly the promising directions that can be utilized for further improvements.

Download Full-text

Blind Image Quality Assessment Bases On Natural Scene Statistics And Deep Learning

Proceedings of the 2015 5th International Conference on Computer Sciences and Automation Engineering ◽

10.2991/iccsae-15.2016.174 ◽

2016 ◽

Author(s):

De Ge ◽

Jianxin Song

Keyword(s):

Deep Learning ◽

Image Quality ◽

Quality Assessment ◽

Image Quality Assessment ◽

Natural Scene ◽

Natural Scene Statistics ◽

Blind Image Quality Assessment ◽

Blind Image

Download Full-text