Detection of Inflatable Boats and People in Thermal Infrared with Deep Learning Methods

Marcin Łukasz Kowalski; Norbert Pałka; Jarosław Młyńczak; Mateusz Karol; Elżbieta Czerwińska; Marek Życzkowski; Wiesław Ciurapiński; Zbigniew Zawadzki; Sebastian Brawata

doi:10.3390/s21165330

Detection of Inflatable Boats and People in Thermal Infrared with Deep Learning Methods

Sensors ◽

10.3390/s21165330 ◽

2021 ◽

Vol 21 (16) ◽

pp. 5330

Author(s):

Marcin Łukasz Kowalski ◽

Norbert Pałka ◽

Jarosław Młyńczak ◽

Mateusz Karol ◽

Elżbieta Czerwińska ◽

...

Keyword(s):

Neural Networks ◽

Feature Extraction ◽

Deep Learning ◽

Processing Time ◽

Graphics Processing Unit ◽

Weather Conditions ◽

Processing Unit ◽

Long Time ◽

Financial Interests ◽

Graphics Processing

Smuggling of drugs and cigarettes in small inflatable boats across border rivers is a serious threat to the EU’s financial interests. Early detection of such threats is challenging due to difficult and changing environmental conditions. This study reports on the automatic detection of small inflatable boats and people in a rough wild terrain in the infrared thermal domain. Three acquisition campaigns were carried out during spring, summer, and fall under various weather conditions. Three deep learning algorithms, namely, YOLOv2, YOLOv3, and Faster R-CNN working with six different feature extraction neural networks were trained and evaluated in terms of performance and processing time. The best performance was achieved with Faster R-CNN with ResNet101, however, processing requires a long time and a powerful graphics processing unit.

Download Full-text

Prediksi Kelas Jamak dengan Deep Learning Berbasis Graphics Processing Units

Jurnal Kajian Ilmiah ◽

10.31599/jki.v20i1.71 ◽

2020 ◽

Vol 20 (1) ◽

pp. 67-76

Author(s):

Rahmadya Trias Handayanto ◽

Herlawati Herlawati

Keyword(s):

Neural Networks ◽

Support Vector Machine ◽

Artificial Neural Networks ◽

Deep Learning ◽

Decision Tree ◽

Graphics Processing Unit ◽

Support Vector ◽

Processing Unit ◽

Class 1 ◽

Graphics Processing

For the first time, machine learning did the classical classification process using two classes (bi-class) such as class -1 and class +1, 0 and 1, or the form of categories such as true and false. Famous methods used are Artificial Neural Networks (ANN) and Support Vector Machine (SVM). The current development was a problem with more than two classes, known as multi-class classes. For SVM sometimes the plural classes are overcome by doing a gradual process like a decision tree (DT) method. Meanwhile, ANN has experienced rapid development and is currently being developed with a large number of layers with the new activation functions, i.e. the rectified linear units (ReLu), and the probabilistic-based activation, i.e. softmax, including its optimizer methods (adam, sgd, and others). Then the term changed to Deep Learning (DL). This study aimed to compare two well-known methods (DL and SVM) in classifying multiple classes. The number of DL layers was six with the neuron composition are 128, 64, 32, 8, 4, and 3, while SVM uses a radial kernel base function with gamma and c respectively 0.7 and 5. Besides, this study intends to compare the use of the Graphics Processing Unit (GPU) available on Google Interactive Notebook (Google Colab), an online Python language programming application. The results showed that DL accuracy outperformed SVM but required large computational resources, with the accuracy for DL and SVM are 99% and 98%, respectively. However, the use of the GPU can overcome these problems and is proven to increase the speed of the process as much as 47 times. Keywords: Artificial Neural Networks, Graphics Processing Unit, Google Interactive Notebook, Rectified Linear units, Support Vector Machine. Abstrak Di awal perkembangannya mesin pembelajaran melakukan proses klasikfikasi menggunakan dua kelas (bi-class) misalnya kelas -1 dan kelas +1, 0 dan 1, atau bentuk kategori seperti benar dan salah. Metode terkenal yang digunakan adalah Jaringan Syaraf Tiruan (JST) dan Support Vector Machine (SVM). Perkembangan selanjutnya adalah problem dengan kelas yang lebih dari dua kelas, dikenal dengan istilah kelas jamak (multi-class). Untuk SVM terkadang kelas jamak diatasi dengan melakukan proses berjenjang mirip pohon keputusan (decision tree). Sementara itu JST telah mengalami perkembangan yang pesat dan saat ini sudah dikembangkan dengan jumlah layer yang banyak disertai dengan fungsi-fungsi aktivasi terkini seperti rectified linear unit (ReLu), dan softmax yang berbasis probabilistik, termasuk juga metode-metode optimizernya (adam, sgd, dan lain-lain). Kemudian istilahnya berubah menjadi Deep Learning (DL). Penelitian ini mencoba membandingkan dua metode terkenal (DL dan SVM) dalam melakukan klasifikasi kelas jamak. Jumlah layer DL sebanyak enam dengan masing-masing neuron sebesar 128, 64, 32, 8, 4, dan 3, sementara SVM menggunakan kernel radial basis function dengan gamma dan c berturut-turut 0.7 dan 5. Selain itu penelitian ini bermaksud membandingkan penggunaan Graphics Processing Unit (GPU) yang tersedia di Google Interactive Notebook (Google Colab), sebuah aplikasi online pemrograman bahasa Python. Hasil penelitian menunjukan akurasi DL unggul tipis dibanding SVM namun memerlukan sumber daya komputasi yang besar masing-masing dengan akurasi 99% dan 98%. Namun penggunaan GPU mampu mengatasi permasalahan tersebut dan terbukti meningkatkan kecepatan proses sebanyak 47 kali. Kata kunci: Jaringan Syaraf Tiruan, Graphics Processing Unit, Google Interactive Notebook, Rectified Linear units, Support Vector Machine.

Download Full-text

Deep Learning Framework for Vehicle and Pedestrian Detection in Rural Roads on an Embedded GPU

Electronics ◽

10.3390/electronics9040589 ◽

2020 ◽

Vol 9 (4) ◽

pp. 589 ◽

Cited By ~ 3

Author(s):

Luis Barba-Guaman ◽

José Eugenio Naranjo ◽

Anthony Ortiz

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Deep Learning ◽

Embedded System ◽

Rural Areas ◽

High Performance ◽

Processing Time ◽

Graphics Processing Unit ◽

Pedestrian Detection ◽

Processing Unit

Object detection, one of the most fundamental and challenging problems in computer vision. Nowadays some dedicated embedded systems have emerged as a powerful strategy for deliver high processing capabilities including the NVIDIA Jetson family. The aim of the present work is the recognition of objects in complex rural areas through an embedded system, as well as the verification of accuracy and processing time. For this purpose, a low power embedded Graphics Processing Unit (Jetson Nano) has been selected, which allows multiple neural networks to be run in simultaneous and a computer vision algorithm to be applied for image recognition. As well, the performance of these deep learning neural networks such as ssd-mobilenet v1 and v2, pednet, multiped and ssd-inception v2 has been tested. Moreover, it was found that the accuracy and processing time were in some cases improved when all the models suggested in the research were applied. The pednet network model provides a high performance in pedestrian recognition, however, the sdd-mobilenet v2 and ssd-inception v2 models are better at detecting other objects such as vehicles in complex scenarios.

Download Full-text

Pattern Recognition in Myoelectric Signals Using Deep Learning, Features Engineering, and a Graphics Processing Unit

IEEE Access ◽

10.1109/access.2020.3038992 ◽

2020 ◽

Vol 8 ◽

pp. 208952-208960

Author(s):

Gabriel Cirac M. Souza ◽

Robson L. Moreno ◽

Tales C. Pimenta

Keyword(s):

Pattern Recognition ◽

Deep Learning ◽

Graphics Processing Unit ◽

Processing Unit ◽

Myoelectric Signals ◽

Deep Learning Features ◽

Graphics Processing

Download Full-text

Massively Parallel Signal Processing Using the Graphics Processing Unit for Real-Time Brain-Computer Interface Feature Extraction

Frontiers in Neuroengineering ◽

10.3389/neuro.16.011.2009 ◽

2009 ◽

Vol 2 ◽

Cited By ~ 26

Author(s):

J. Adam Wilson

Keyword(s):

Signal Processing ◽

Feature Extraction ◽

Real Time ◽

Graphics Processing Unit ◽

Brain Computer Interface ◽

Massively Parallel ◽

Computer Interface ◽

Processing Unit ◽

Graphics Processing ◽

Parallel Signal Processing

Download Full-text

Deep learning image recognition enables efficient genome editing in zebrafish by automated injections

10.1101/384735 ◽

2018 ◽

Cited By ~ 2

Author(s):

Maria Lorena Cordero-Maldonado ◽

Simon Perathoner ◽

Kees-Jan van der Kolk ◽

Ralf Boland ◽

Ursula Heins-Marroquin ◽

...

Keyword(s):

Deep Learning ◽

Graphics Processing Unit ◽

Second Step ◽

Processing Unit ◽

Cell Stage ◽

Knock Out ◽

Larval Stages ◽

Learning Software ◽

Graphics Processing ◽

Automated Injection

AbstractOne of the most popular techniques in zebrafish research is microinjection, as it is a rapid and efficient way to genetically manipulate early developing embryos, and to introduce microbes or tracers at larval stages.Here we demonstrate the development of a machine learning software that allows for microinjection at a trained target site in zebrafish eggs at unprecedented speed. The software is based on the open-source deep-learning library Inception v3.In a first step, the software distinguishes wells containing embryos at one-cell stage from wells to be skipped with an accuracy of 93%. A second step was developed to pinpoint the injection site. Deep learning allows to predict this location on average within 42 µm to manually annotated sites. Using a Graphics Processing Unit (GPU), both steps together take less than 100 milliseconds. We first tested our system by injecting a morpholino into the middle of the yolk and found that the automated injection efficiency is as efficient as manual injection (~ 80%). Next, we tested both CRISPR/Cas9 and DNA construct injections into the zygote and obtained a comparable efficiency to that of an experienced experimentalist. Combined with a higher throughput, this results in a higher yield. Hence, the automated injection of CRISPR/Cas9 will allow high-throughput applications to knock out and knock in relevant genes to study their mechanisms or pathways of interest in diverse areas of biomedical research.

Download Full-text

TIME-DOMAIN INTERPOLATION ON GRAPHICS PROCESSING UNIT

Journal of Innovative Optical Health Sciences ◽

10.1142/s1793545811001277 ◽

2011 ◽

Vol 04 (01) ◽

pp. 89-95 ◽

Cited By ~ 3

Author(s):

XIQI LI ◽

GUOHUA SHI ◽

YUDONG ZHANG

Keyword(s):

Signal Processing ◽

Real Time ◽

Time Domain ◽

Processing Time ◽

Graphics Processing Unit ◽

Interpolation Method ◽

Time Signal ◽

Processing Unit ◽

Graphics Processing ◽

Sd Oct

The signal processing speed of spectral domain optical coherence tomography (SD-OCT) has become a bottleneck in a lot of medical applications. Recently, a time-domain interpolation method was proposed. This method can get better signal-to-noise ratio (SNR) but much-reduced signal processing time in SD-OCT data processing as compared with the commonly used zero-padding interpolation method. Additionally, the resampled data can be obtained by a few data and coefficients in the cutoff window. Thus, a lot of interpolations can be performed simultaneously. So, this interpolation method is suitable for parallel computing. By using graphics processing unit (GPU) and the compute unified device architecture (CUDA) program model, time-domain interpolation can be accelerated significantly. The computing capability can be achieved more than 250,000 A-lines, 200,000 A-lines, and 160,000 A-lines in a second for 2,048 pixel OCT when the cutoff length is L = 11, L = 21, and L = 31, respectively. A frame SD-OCT data (400A-lines × 2,048 pixel per line) is acquired and processed on GPU in real time. The results show that signal processing time of SD-OCT can be finished in 6.223 ms when the cutoff length L = 21, which is much faster than that on central processing unit (CPU). Real-time signal processing of acquired data can be realized.

Download Full-text

Parallel Computing, Graphics Processing Unit (GPU) and New Hardware for Deep Learning in Computational Intelligence Research

Deep Learning and Parallel Computing Environment for Bioengineering Systems ◽

10.1016/b978-0-12-816718-2.00008-7 ◽

2019 ◽

pp. 1-15 ◽

Cited By ~ 2

Author(s):

M. Madiajagan ◽

S. Sridhar Raj

Keyword(s):

Parallel Computing ◽

Deep Learning ◽

Computational Intelligence ◽

Graphics Processing Unit ◽

Processing Unit ◽

Graphics Processing

Download Full-text

Cellular Neural Networks simulation on a parallel graphics processing unit

2008 11th International Workshop on Cellular Neural Networks and Their Applications ◽

10.1109/cnna.2008.4588679 ◽

2008 ◽

Cited By ~ 7

Author(s):

Andres Fernandez ◽

Ruben San Martin ◽

Enric Farguell ◽

Giovanni Egidio Pazienza

Keyword(s):

Neural Networks ◽

Graphics Processing Unit ◽

Cellular Neural Networks ◽

Processing Unit ◽

Parallel Graphics ◽

Graphics Processing

Download Full-text

Artificial intelligence-based classification with classical Turkish music makams: Possibilities to Turkish music education

African Educational Research Journal ◽

10.30918/aerj.92.21.055 ◽

2021 ◽

Vol 9 (2) ◽

pp. 570-580

Author(s):

Mert Kayış ◽

Keyword(s):

Artificial Intelligence ◽

Feature Extraction ◽

Music Education ◽

Success Rate ◽

Classical Music ◽

Graphics Processing Unit ◽

Support Vector ◽

Processing Unit ◽

Percent Success ◽

Graphics Processing

Makams of Classical Turkish Music have been tried to be classified through various studies for the past years. Significant differences of opinion have emerged in the classification process of the makams in Music Education and Literacy from past to present. This situation creates problems in learning the makams related to music education and recognizing the makams heard. Additionally, there are uncertainties in the classification of the makam genre of the song, as individual mistakes were made while notating the musical notes. Apart from that, this situation constitutes a problem not only for the ones studying Turkish Classical Music but also for the ones interested in this certain type of Music. Therefore, the objective of the research is to contribute to the makam classification in Classical Turkish Music Education by developing an MIR system that determines the makam of the songs. Theoretically, we can extract the properties of sound signals with Time Wavelet Scattering Feature Extraction, classify them with SVM and distinguish between types of makams. In this study, upon eight different Makams, a Musical Information Retrieval system has been created via the Artificial Intelligence (AI) method of Support Vector Machines (SVM) and Time Wavelet Scattering Feature Extraction and through using a Graphics Processing Unit (GPU) accelerator for the sake of feature extraction. We performed the classification process by modeling it on the MATLAB program. The study's success rate was identified as 98.21% and it acquired a higher success rate compared to the other studies in the literature. After completing the classification procedure, the Makams were identified by sending samples belonging to different sound files from the system consisting of a database belonging to eight different Makams. In our study, the classification and detection processes were realized with nearly a hundred percent success. The difficulties encountered in classifying the makams in Classical Turkish Music mentioned above, with the application of artificial intelligence, the classification difficulty of individuals who have received this type of training or are interested in this subject has been overcome.

Download Full-text

True 4D Image Denoising on the GPU

International Journal of Biomedical Imaging ◽

10.1155/2011/952819 ◽

2011 ◽

Vol 2011 ◽

pp. 1-16 ◽

Cited By ~ 14

Author(s):

Anders Eklund ◽

Mats Andersson ◽

Hans Knutsson

Keyword(s):

Image Denoising ◽

Processing Time ◽

Graphics Processing Unit ◽

Spatial Filtering ◽

Processing Unit ◽

Clinical Value ◽

Ct Data ◽

Graphics Processing ◽

Common Application ◽

Work Done

The use of image denoising techniques is an important part of many medical imaging applications. One common application is to improve the image quality of low-dose (noisy) computed tomography (CT) data. While 3D image denoising previously has been applied to several volumes independently, there has not been much work done on true 4D image denoising, where the algorithm considers several volumes at the same time. The problem with 4D image denoising, compared to 2D and 3D denoising, is that the computational complexity increases exponentially. In this paper we describe a novel algorithm for true 4D image denoising, based on local adaptive filtering, and how to implement it on the graphics processing unit (GPU). The algorithm was applied to a 4D CT heart dataset of the resolution 512 × 512 × 445 × 20. The result is that the GPU can complete the denoising in about 25 minutes if spatial filtering is used and in about 8 minutes if FFT-based filtering is used. The CPU implementation requires several days of processing time for spatial filtering and about 50 minutes for FFT-based filtering. The short processing time increases the clinical value of true 4D image denoising significantly.

Download Full-text