Statistic Experience Based Adaptive One-Shot Detector (EAO) for Camera Sensing System

Xiaoning Zhu; Bojian Ding; Qingyue Meng; Lize Gu; Yixian Yang

doi:10.3390/s18093041

Statistic Experience Based Adaptive One-Shot Detector (EAO) for Camera Sensing System

Sensors ◽

10.3390/s18093041 ◽

2018 ◽

Vol 18 (9) ◽

pp. 3041

Author(s):

Xiaoning Zhu ◽

Bojian Ding ◽

Qingyue Meng ◽

Lize Gu ◽

Yixian Yang

Keyword(s):

Neural Network ◽

Image Processing ◽

Object Recognition ◽

Processing Speed ◽

Spectral Clustering ◽

State Of The Art ◽

Sensing System ◽

Proposed Model ◽

Working Performance ◽

Effectiveness And Efficiency

Object detection in a camera sensing system has been addressed by researchers in the field of image processing. Highly-developed techniques provide researchers with great opportunities to recognize objects by applying different algorithms. This paper proposes an object recognition model, named Statistic Experience-based Adaptive One-shot Detector (EAO), based on convolutional neural network. The proposed model makes use of spectral clustering to make detection dataset, generates prior boxes for object bounding and assigns prior boxes based on multi-resolution. The model is constructed and trained for improving the detection precision and the processing speed. Experiments are conducted on classical images datasets while the results demonstrate the superiority of EAO in terms of effectiveness and efficiency. Working performance of the EAO is verified by comparing it to several state-of-the-art approaches, which makes it a promising method for the development of the camera sensing technique.

Download Full-text

Fully Symmetric Convolutional Network for Effective Image Denoising

Applied Sciences ◽

10.3390/app9040778 ◽

2019 ◽

Vol 9 (4) ◽

pp. 778 ◽

Cited By ~ 3

Author(s):

Steffi Priyanka ◽

Yuan-Kai Wang

Keyword(s):

Neural Network ◽

Image Processing ◽

Image Denoising ◽

Gpu Computing ◽

State Of The Art ◽

Large Data ◽

Convolutional Network ◽

Proposed Model ◽

Feature Extractor ◽

A Chain

Neural-network-based image denoising is one of the promising approaches to deal with problems in image processing. In this work, a deep fully symmetric convolutional–deconvolutional neural network (FSCN) is proposed for image denoising. The proposed model comprises a novel architecture with a chain of successive symmetric convolutional–deconvolutional layers. This framework learns convolutional–deconvolutional mappings from corrupted images to the clean ones in an end-to-end fashion without using image priors. The convolutional layers act as feature extractor to encode primary components of the image contents while eliminating corruptions, and the deconvolutional layers then decode the image abstractions to recover the image content details. An adaptive moment optimizer is used to minimize the reconstruction loss as it is appropriate for large data and noisy images. Extensive experiments were conducted for image denoising to evaluate the FSCN model against the existing state-of-the-art denoising algorithms. The results show that the proposed model achieves superior denoising, both qualitatively and quantitatively. This work also presents the efficient implementation of the FSCN model by using GPU computing which makes it easy and attractive for practical denoising applications.

Download Full-text

Proposed Model for an Expert System for Diagnosing Degenerative Diseases – Using Digital Image Processing with Neural Network

Information and Communication Technology for Intelligent Systems (ICTIS 2017) - Volume 1 - Smart Innovation, Systems and Technologies ◽

10.1007/978-3-319-63673-3_8 ◽

2017 ◽

pp. 68-73

Author(s):

Mittal N. Desai ◽

Vishal Dahiya ◽

A. K. Singh

Keyword(s):

Neural Network ◽

Image Processing ◽

Expert System ◽

Digital Image Processing ◽

Digital Image ◽

Degenerative Diseases ◽

Proposed Model

Download Full-text

Enhancing the performance of cancer text classification model based on cancer hallmarks

IAES International Journal of Artificial Intelligence (IJ-AI) ◽

10.11591/ijai.v10.i2.pp316-323 ◽

2021 ◽

Vol 10 (2) ◽

pp. 316

Author(s):

Noha Ali ◽

Ahmed H. AbuEl-Atta ◽

Hala H. Zayed

Keyword(s):

Neural Network ◽

Language Processing ◽

Text Classification ◽

State Of The Art ◽

Classification Model ◽

Biomedical Text ◽

Cancer Hallmarks ◽

Embedding Technique ◽

Proposed Model ◽

Biomedical Text Classification

Deep learning (DL) algorithms achieved state-of-the-art performance in computer vision, speech recognition, and natural language processing (NLP). In this paper, we enhance the convolutional neural network (CNN) algorithm to classify cancer articles according to cancer hallmarks. The model implements a recent word embedding technique in the embedding layer. This technique uses the concept of distributed phrase representation and multi-word phrases embedding. The proposed model enhances the performance of the existing model used for biomedical text classification. The result of the proposed model overcomes the previous model by achieving an F-score equal to 83.87% using an unsupervised technique that trained on PubMed abstracts called PMC vectors (PMCVec) embedding. Also, we made another experiment on the same dataset using the recurrent neural network (RNN) algorithm with two different word embeddings Google news and PMCVec which achieving F-score equal to 74.9% and 76.26%, respectively.

Download Full-text

Relational Graph Neural Network with Hierarchical Attention for Knowledge Graph Completion

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6508 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9612-9619

Author(s):

Zhao Zhang ◽

Fuzhen Zhuang ◽

Hengshu Zhu ◽

Zhiping Shi ◽

Hui Xiong ◽

...

Keyword(s):

Neural Network ◽

Missing Values ◽

State Of The Art ◽

Attention Mechanism ◽

Knowledge Graph ◽

Incomplete Knowledge ◽

Neighborhood Information ◽

Local Neighborhood ◽

Proposed Model ◽

Knowledge Graphs

The rapid proliferation of knowledge graphs (KGs) has changed the paradigm for various AI-related applications. Despite their large sizes, modern KGs are far from complete and comprehensive. This has motivated the research in knowledge graph completion (KGC), which aims to infer missing values in incomplete knowledge triples. However, most existing KGC models treat the triples in KGs independently without leveraging the inherent and valuable information from the local neighborhood surrounding an entity. To this end, we propose a Relational Graph neural network with Hierarchical ATtention (RGHAT) for the KGC task. The proposed model is equipped with a two-level attention mechanism: (i) the first level is the relation-level attention, which is inspired by the intuition that different relations have different weights for indicating an entity; (ii) the second level is the entity-level attention, which enables our model to highlight the importance of different neighboring entities under the same relation. The hierarchical attention mechanism makes our model more effective to utilize the neighborhood information of an entity. Finally, we extensively validate the superiority of RGHAT against various state-of-the-art baselines.

Download Full-text

Graph Dilated Network with Rejection Mechanism

Applied Sciences ◽

10.3390/app10072421 ◽

2020 ◽

Vol 10 (7) ◽

pp. 2421

Author(s):

Bencheng Yan ◽

Chaokun Wang ◽

Gaoyang Guo

Keyword(s):

Neural Network ◽

State Of The Art ◽

Great Success ◽

Convolution Kernel ◽

Graph Structure ◽

Proposed Model ◽

Rejection Mechanism ◽

Laplacian Smoothing ◽

Graph Neural Networks ◽

High Level

Recently, graph neural networks (GNNs) have achieved great success in dealing with graph-based data. The basic idea of GNNs is iteratively aggregating the information from neighbors, which is a special form of Laplacian smoothing. However, most of GNNs fall into the over-smoothing problem, i.e., when the model goes deeper, the learned representations become indistinguishable. This reflects the inability of the current GNNs to explore the global graph structure. In this paper, we propose a novel graph neural network to address this problem. A rejection mechanism is designed to address the over-smoothing problem, and a dilated graph convolution kernel is presented to capture the high-level graph structure. A number of experimental results demonstrate that the proposed model outperforms the state-of-the-art GNNs, and can effectively overcome the over-smoothing problem.

Download Full-text

Sky Monitoring System for Flying Object Detection Using 4K Resolution Camera

Sensors ◽

10.3390/s20247071 ◽

2020 ◽

Vol 20 (24) ◽

pp. 7071

Author(s):

Takehiro Kashiyama ◽

Hideaki Sobue ◽

Yoshihide Sekimoto

Keyword(s):

Neural Network ◽

Image Processing ◽

Monitoring System ◽

State Of The Art ◽

Weather Conditions ◽

Network Technology ◽

Real Time Processing ◽

Time Processing ◽

High Rise ◽

Neural Network Technology

The use of drones and other unmanned aerial vehicles has expanded rapidly in recent years. These devices are expected to enter practical use in various fields, such as taking measurements through aerial photography and transporting small and lightweight objects. Simultaneously, concerns over these devices being misused for terrorism or other criminal activities have increased. In response, several sensor systems have been developed to monitor drone flights. In particular, with the recent progress of deep neural network technology, the monitoring of systems using image processing has been proposed. This study developed a monitoring system for flying objects using a 4K camera and a state-of-the-art convolutional neural network model to achieve real-time processing. We installed a monitoring system in a high-rise building in an urban area during this study and evaluated the precision with which it could detect flying objects at different distances under different weather conditions. The results obtained provide important information for determining the accuracy of monitoring systems with image processing in practice.

Download Full-text

A modified Bayesian Convolutional Neural Network for Breast Histopathology Image Classification and Uncertainty Quantification

10.31224/osf.io/5xf8c ◽

2020 ◽

Author(s):

Pushkar Khairnar ◽

Ponkrshnan Thiagarajan ◽

Susanta Ghosh

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Uncertainty Quantification ◽

State Of The Art ◽

False Negative ◽

Activation Function ◽

Ensemble Averaging ◽

Proposed Model ◽

Improve Accuracy ◽

Histopathological Images

Convolutional neural network (CNN) based classification models have been successfully used on histopathological images for the detection of diseases. Despite its success, CNN may yield erroneous or overfitted results when the data is not sufficiently large or is biased. To overcome these limitations of CNN and to provide uncertainty quantification Bayesian CNN is recently proposed. However, we show that Bayesian-CNN still suffers from inaccuracies, especially in negative predictions. In the present work, we extend the Bayesian-CNN to improve accuracy and the rate of convergence. The proposed model is called modified Bayesian-CNN. The novelty of the proposed model lies in an adaptive activation function that contains a learnable parameter for each of the neurons. This adaptive activation function dynamically changes the loss function thereby providing faster convergence and better accuracy. The uncertainties associated with the predictions are obtained since the model learns a probability distribution on the network parameters. It reduces overfitting through an ensemble averaging over networks, which in turn improves accuracy on the unknown data. The proposed model demonstrates significant improvement by nearly eliminating overfitting and remarkably reducing (about 38%) the number of false-negative predictions. We found that the proposed model predicts higher uncertainty for images having features of both the classes. The uncertainty in the predictions of individual images can be used to decide when further human-expert intervention is needed. These findings have the potential to advance the state-of-the-art machine learning-based automatic classification for histopathological images.

Download Full-text

Using high-performance deep learning platform to accelerate object detection

Information Technology and Nanotechnology ◽

10.18287/1613-0073-2019-2416-354-360 ◽

2019 ◽

pp. 354-360 ◽

Cited By ~ 1

Author(s):

S O Stepanenko ◽

P Y Yakimov

Keyword(s):

Neural Network ◽

Neural Networks ◽

Image Processing ◽

Processing Speed ◽

High Performance ◽

Object Classification ◽

High Accuracy ◽

Learning Platform ◽

The Neural Network

Object classification with use of neural networks is extremely current today. YOLO is one of the most often used frameworks for object classification. It produces high accuracy but the processing speed is not high enough especially in conditions of limited performance of a computer. This article researches use of a framework called NVIDIA TensorRT to optimize YOLO with the aim of increasing the image processing speed. Saving efficiency and quality of the neural network work TensorRT allows us to increase the processing speed using an optimization of the architecture and an optimization of calculations on a GPU.

Download Full-text

Domain Adaptation and Domain Generalization with Representation Learning

10.26686/wgtn.17014700 ◽

2021 ◽

Author(s):

◽

Muhammad Ghifary

Keyword(s):

Neural Network ◽

Object Recognition ◽

Domain Adaptation ◽

State Of The Art ◽

Representation Learning ◽

Training Data ◽

Data Representations ◽

Source Data ◽

Target Environment ◽

Target Data

Machine learning has achieved great successes in the area of computer vision, especially in object recognition or classification. One of the core factors of the successes is the availability of massive labeled image or video data for training, collected manually by human. Labeling source training data, however, can be expensive and time consuming. Furthermore, a large amount of labeled source data may not always guarantee traditional machine learning techniques to generalize well; there is a potential bias or mismatch in the data, i.e., the training data do not represent the target environment. To mitigate the above dataset bias/mismatch, one can consider domain adaptation: utilizing labeled training data and unlabeled target data to develop a well-performing classifier on the target environment. In some cases, however, the unlabeled target data are nonexistent, but multiple labeled sources of data exist. Such situations can be addressed by domain generalization: using multiple source training sets to produce a classifier that generalizes on the unseen target domain. Although several domain adaptation and generalization approaches have been proposed, the domain mismatch in object recognition remains a challenging, open problem – the model performance has yet reached to a satisfactory level in real world applications. The overall goal of this thesis is to progress towards solving dataset bias in visual object recognition through representation learning in the context of domain adaptation and domain generalization. Representation learning is concerned with finding proper data representations or features via learning rather than via engineering by human experts. This thesis proposes several representation learning solutions based on deep learning and kernel methods. This thesis introduces a robust-to-noise deep neural network for handwritten digit classification trained on “clean” images only, which we name Deep Hybrid Network (DHN). DHNs are based on a particular combination of sparse autoencoders and restricted Boltzmann machines. The results show that DHN performs better than the standard deep neural network in recognizing digits with Gaussian and impulse noise, block and border occlusions. This thesis proposes the Domain Adaptive Neural Network (DaNN), a neural network based domain adaptation algorithm that minimizes the classification error and the domain discrepancy between the source and target data representations. The experiments show the competitiveness of DaNN against several state-of-the-art methods on a benchmark object dataset. This thesis develops the Multi-task Autoencoder (MTAE), a domain generalization algorithm based on autoencoders trained via multi-task learning. MTAE learns to transform the original image into its analogs in multiple related domains simultaneously. The results show that the MTAE’s representations provide better classification performance than some alternative autoencoder-based models as well as the current state-of-the-art domain generalization algorithms. This thesis proposes a fast kernel-based representation learning algorithm for both domain adaptation and domain generalization, Scatter Component Analysis (SCA). SCA finds a data representation that trades between maximizing the separability of classes, minimizing the mismatch between domains, and maximizing the separability of the whole data points. The results show that SCA performs much faster than some competitive algorithms, while providing state-of-the-art accuracy in both domain adaptation and domain generalization. Finally, this thesis presents the Deep Reconstruction-Classification Network (DRCN), a deep convolutional network for domain adaptation. DRCN learns to classify labeled source data and also to reconstruct unlabeled target data via a shared encoding representation. The results show that DRCN provides competitive or better performance than the prior state-of-the-art model on several cross-domain object datasets.

Download Full-text

Knowledge Transfer for Out-of-Knowledge-Base Entities : A Graph Neural Network Approach

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/250 ◽

2017 ◽

Cited By ~ 26

Author(s):

Takuo Hamaguchi ◽

Hidekazu Oiwa ◽

Masashi Shimbo ◽

Yuji Matsumoto

Keyword(s):

Neural Network ◽

Knowledge Base ◽

State Of The Art ◽

Test Time ◽

Network Approach ◽

Missing Information ◽

Neural Network Approach ◽

Training Time ◽

Proposed Model ◽

Graph Neural Networks

Knowledge base completion (KBC) aims to predict missing information in a knowledge base. In this paper, we address the out-of-knowledge-base (OOKB) entity problem in KBC: how to answer queries concerning test entities not observed at training time. Existing embedding-based KBC models assume that all test entities are available at training time, making it unclear how to obtain embeddings for new entities without costly retraining. To solve the OOKB entity problem without retraining, we use graph neural networks (Graph-NNs) to compute the embeddings of OOKB entities, exploiting the limited auxiliary knowledge provided at test time. The experimental results show the effectiveness of our proposed model in the OOKB setting. Additionally, in the standard KBC setting in which OOKB entities are not involved, our model achieves state-of-the-art performance on the WordNet dataset.

Download Full-text