Deep Residual Network in Network

Computational Intelligence and Neuroscience ◽

10.1155/2021/6659083 ◽

2021 ◽

Vol 2021 ◽

pp. 1-9

Author(s):

Hmidi Alaeddine ◽

Malek Jihene

Keyword(s):

Recognition Performance ◽

Nonlinear Function ◽

Linear Filter ◽

Residual Network ◽

Learning Framework ◽

Residual Learning ◽

Learning Functions ◽

Speed Up ◽

Important Extension ◽

On Chip

Deep network in network (DNIN) model is an efficient instance and an important extension of the convolutional neural network (CNN) consisting of alternating convolutional layers and pooling layers. In this model, a multilayer perceptron (MLP), a nonlinear function, is exploited to replace the linear filter for convolution. Increasing the depth of DNIN can also help improve classification accuracy while its formation becomes more difficult, learning time gets slower, and accuracy becomes saturated and then degrades. This paper presents a new deep residual network in network (DrNIN) model that represents a deeper model of DNIN. This model represents an interesting architecture for on-chip implementations on FPGAs. In fact, it can be applied to a variety of image recognition applications. This model has a homogeneous and multilength architecture with the hyperparameter “L” (“L” defines the model length). In this paper, we will apply the residual learning framework to DNIN and we will explicitly reformulate convolutional layers as residual learning functions to solve the vanishing gradient problem and facilitate and speed up the learning process. We will provide a comprehensive study showing that DrNIN models can gain accuracy from a significantly increased depth. On the CIFAR-10 dataset, we evaluate the proposed models with a depth of up to L = 5 DrMLPconv layers, 1.66x deeper than DNIN. The experimental results demonstrate the efficiency of the proposed method and its role in providing the model with a greater capacity to represent features and thus leading to better recognition performance.

Download Full-text

DRHNet: A Deep Residual Network Based on Heterogeneous Kernel for Steganalysis

Security and Communication Networks ◽

10.1155/2020/8847741 ◽

2020 ◽

Vol 2020 ◽

pp. 1-9

Author(s):

Yang Xu ◽

Zixi Fu ◽

Guiyong Xu ◽

Sicong Zhang ◽

Xiaoyao Xie

Keyword(s):

Neural Networks ◽

Heterogeneous Network ◽

State Of The Art ◽

Training Phase ◽

Image Size ◽

Residual Network ◽

Training Time ◽

Learning Framework ◽

Residual Learning ◽

The Rich

Convolutional neural networks as steganalysis have problems such as poor versatility, long training time, and limited image size. For these problems, we present a heterogeneous kernel residual learning framework called DRHNet—Dual Residual Heterogeneous Network—to save time on the networks during the training phase. Instead of using the image as an input of the network, we extract and merge the images into a feature matrix using the rich model and use the generated feature matrix as the real input of the network. The architecture we proposed has good versatility and can reduce the computation and the number of parameters while still getting higher accuracy. On BOSSbase 1.01, we evaluate the performance of DRHNet in the setting of the spatial domain and frequency domain. The preliminary experimental results show that DRHNet shows excellent steganalysis performance against the state-of-the-art steganographic algorithms.

Download Full-text

Multiprocessor transport surveillance video system based on "system on chip" technology

MORSKIE INTELLEKTUAL`NYE TEHNOLOGII ◽

10.37220/mit.2020.49.3.023 ◽

2020 ◽

Author(s):

Ш.С. Фахми ◽

Н.В. Шаталова ◽

В.В. Вислогузов ◽

Е.В. Костикова

Keyword(s):

System On Chip ◽

Reference Points ◽

Surveillance Video ◽

Video Information ◽

Chip Technology ◽

Intelligent Video Surveillance System ◽

Speed Up ◽

Algorithmic Analysis ◽

Computational Procedures ◽

On Chip

В данной работе предлагаются математический аппарат и архитектура многопроцессорной транспортной системы на кристалле (МПТСнК). Выполнена программно-аппаратная реализация интеллектуальной системы видеонаблюдения на базе технологии «система на кристалле» и с использованием аппаратного ускорителя известного метода формирования опорных векторов. Архитектура включает в себя сложно-функциональные блоки анализа видеоинформации на базе параллельных алгоритмов нахождения опорных точек изображений и множества элементарных процессоров для выполнения сложных вычислительных процедур алгоритмов анализа с использованием средств проектирования на базе реконфигурируемой системы на кристалле, позволяющей оценить количество аппаратных ресурсов. Предлагаемая архитектура МПТСнК позволяет ускорить обработку и анализ видеоинформации при решении задач обнаружения и распознавания чрезвычайных ситуаций и подозрительных поведений. In this paper, we propose the mathematical apparatus and architecture of a multiprocessor transport system on a chip (MPTSoC). Software and hardware implementation of an intelligent video surveillance system based on the "system on chip" technology and using a hardware accelerator of the well-known method of forming reference vectors. The architecture includes complex functional blocks for analyzing video information based on parallel algorithms for finding image reference points and a set of elementary processors for performing complex computational procedures for algorithmic analysis. using design tools based on a reconfigurable system on chip that allows you to estimate the amount of hardware resources. The proposed MPTSoC architecture makes it possible to speed up the processing and analysis of video information when solving problems of detecting and recognizing emergencies and suspicious behaviors

Download Full-text

Reconfigurable field‐programmable gate array‐based on‐chip learning neuromorphic digital implementation for nonlinear function approximation

International Journal of Circuit Theory and Applications ◽

10.1002/cta.3075 ◽

2021 ◽

Author(s):

Morteza Gholami ◽

Edris Zaman Farsa ◽

Gholamreza Karimi

Keyword(s):

Field Programmable Gate Array ◽

Function Approximation ◽

Nonlinear Function ◽

Digital Implementation ◽

Field Programmable ◽

Gate Array ◽

On Chip ◽

Nonlinear Function Approximation

Download Full-text

Privacy Preserving Machine Learning with Homomorphic Encryption and Federated Learning

Future Internet ◽

10.3390/fi13040094 ◽

2021 ◽

Vol 13 (4) ◽

pp. 94

Author(s):

Haokun Fang ◽

Quan Qian

Keyword(s):

Machine Learning ◽

Homomorphic Encryption ◽

Privacy Preserving ◽

Great Success ◽

Learning Framework ◽

Computational Overhead ◽

Important Concern ◽

Speed Up ◽

Key Length ◽

Core Idea

Privacy protection has been an important concern with the great success of machine learning. In this paper, it proposes a multi-party privacy preserving machine learning framework, named PFMLP, based on partially homomorphic encryption and federated learning. The core idea is all learning parties just transmitting the encrypted gradients by homomorphic encryption. From experiments, the model trained by PFMLP has almost the same accuracy, and the deviation is less than 1%. Considering the computational overhead of homomorphic encryption, we use an improved Paillier algorithm which can speed up the training by 25–28%. Moreover, comparisons on encryption key length, the learning network structure, number of learning clients, etc. are also discussed in detail in the paper.

Download Full-text

Face Anti-spoofing using Hybrid Residual Learning Framework

2019 International Conference on Biometrics (ICB) ◽

10.1109/icb45273.2019.8987283 ◽

2019 ◽

Author(s):

Usman Muhammad ◽

Abdenour Hadid

Keyword(s):

Learning Framework ◽

Residual Learning

Download Full-text

How to Speed-Up Fault-Tolerant Clock Generation in VLSI Systems-on-Chip via Pipelining

2010 European Dependable Computing Conference ◽

10.1109/edcc.2010.35 ◽

2010 ◽

Cited By ~ 5

Author(s):

Matthias Függer ◽

Andreas Dielacher ◽

Ulrich Schmid

Keyword(s):

Fault Tolerant ◽

Clock Generation ◽

Systems On Chip ◽

Speed Up ◽

On Chip

Download Full-text

Spectral–Spatial Residual Network for Hyperspectral Image Classification: A 3-D Deep Learning Framework

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2017.2755542 ◽

2018 ◽

Vol 56 (2) ◽

pp. 847-858 ◽

Cited By ~ 188

Author(s):

Zilong Zhong ◽

Jonathan Li ◽

Zhiming Luo ◽

Michael Chapman

Keyword(s):

Deep Learning ◽

Image Classification ◽

Hyperspectral Image ◽

Hyperspectral Image Classification ◽

Residual Network ◽

Learning Framework

Download Full-text

Request, Coalesce, Serve, and Forget: Miss-Optimized Memory Systems for Bandwidth-Bound Cache-Unfriendly Applications on FPGAs

ACM Transactions on Reconfigurable Technology and Systems ◽

10.1145/3466823 ◽

2022 ◽

Vol 15 (2) ◽

pp. 1-33

Author(s):

Mikhail Asiatici ◽

Paolo Ienne

Keyword(s):

Large Scale ◽

Sparse Matrix ◽

Memory Systems ◽

Graph Analytics ◽

Matrix Vector Multiplication ◽

Area Reduction ◽

Cache Line ◽

Speed Up ◽

Memory Accesses ◽

On Chip

Applications such as large-scale sparse linear algebra and graph analytics are challenging to accelerate on FPGAs due to the short irregular memory accesses, resulting in low cache hit rates. Nonblocking caches reduce the bandwidth required by misses by requesting each cache line only once, even when there are multiple misses corresponding to it. However, such reuse mechanism is traditionally implemented using an associative lookup. This limits the number of misses that are considered for reuse to a few tens, at most. In this article, we present an efficient pipeline that can process and store thousands of outstanding misses in cuckoo hash tables in on-chip SRAM with minimal stalls. This brings the same bandwidth advantage as a larger cache for a fraction of the area budget, because outstanding misses do not need a data array, which can significantly speed up irregular memory-bound latency-insensitive applications. In addition, we extend nonblocking caches to generate variable-length bursts to memory, which increases the bandwidth delivered by DRAMs and their controllers. The resulting miss-optimized memory system provides up to 25% speedup with 24× area reduction on 15 large sparse matrix-vector multiplication benchmarks evaluated on an embedded and a datacenter FPGA system.

Download Full-text