A YOLOv2 Convolutional Neural Network-Based Human–Machine Interface for the Control of Assistive Robotic Manipulators

Gianluca Giuffrida; Gabriele Meoni; Luca Fanucci

doi:10.3390/app9112243

A YOLOv2 Convolutional Neural Network-Based Human–Machine Interface for the Control of Assistive Robotic Manipulators

Applied Sciences ◽

10.3390/app9112243 ◽

2019 ◽

Vol 9 (11) ◽

pp. 2243

Author(s):

Gianluca Giuffrida ◽

Gabriele Meoni ◽

Luca Fanucci

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Low Cost ◽

Test Procedure ◽

Target Position ◽

Video Stream ◽

Human Machine Interface ◽

Position Information ◽

Machine Interface ◽

Feedback Algorithm

During the last years, the mobility of people with upper limb disabilities and constrained on power wheelchairs is empowered by robotic arms. Nowadays, even though modern manipulators offer a high number of functionalities, some users cannot exploit all those potentialities due to their reduced manual skills, even if capable of driving the wheelchair by means of proper Human–Machine Interface (HMI). Owing to that, this work proposes a low-cost manipulator realizing only simple tasks and controllable by three different graphical HMI. The latter are empowered using a You Only Look Once (YOLO) v2 Convolutional Neural Network that analyzes the video stream generated by a camera placed on the robotic arm end-effector and recognizes the objects with which the user can interact. Such objects are shown to the user in the HMI surrounded by a bounding box. When the user selects one of the recognized objects, the target position information is exploited by an automatic close-feedback algorithm which leads the manipulator to automatically perform the desired task. A test procedure showed that the accuracy in reaching the desired target is 78%. The produced HMIs were appreciated by different user categories, obtaining a mean score of 8.13/10.

Download Full-text

Voice Keyword Recognition Based on Spiking Convolutional Neural Network for Human-Machine Interface

2020 3rd International Conference on Intelligent Autonomous Systems (ICoIAS) ◽

10.1109/icoias49312.2020.9081859 ◽

2020 ◽

Author(s):

Jinhai Hu ◽

Wang Ling Goh ◽

Zhongyi Zhang ◽

Yuan Gao

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Human Machine Interface ◽

Machine Interface ◽

Keyword Recognition

Download Full-text

End-to-end convolutional neural network enables COVID-19 detection from breath and cough audio: a pilot study

BMJ Innovations ◽

10.1136/bmjinnov-2021-000668 ◽

2021 ◽

Vol 7 (2) ◽

pp. 356-362

Author(s):

Harry Coppock ◽

Alex Gaskell ◽

Panagiotis Tzirakis ◽

Alice Baird ◽

Lyn Jones ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Low Cost ◽

Area Under The Curve ◽

Alternative Form ◽

Economic Damage ◽

Audio Signals ◽

Operating Characteristics ◽

Data Set ◽

Empirical Performance

BackgroundSince the emergence of COVID-19 in December 2019, multidisciplinary research teams have wrestled with how best to control the pandemic in light of its considerable physical, psychological and economic damage. Mass testing has been advocated as a potential remedy; however, mass testing using physical tests is a costly and hard-to-scale solution.MethodsThis study demonstrates the feasibility of an alternative form of COVID-19 detection, harnessing digital technology through the use of audio biomarkers and deep learning. Specifically, we show that a deep neural network based model can be trained to detect symptomatic and asymptomatic COVID-19 cases using breath and cough audio recordings.ResultsOur model, a custom convolutional neural network, demonstrates strong empirical performance on a data set consisting of 355 crowdsourced participants, achieving an area under the curve of the receiver operating characteristics of 0.846 on the task of COVID-19 classification.ConclusionThis study offers a proof of concept for diagnosing COVID-19 using cough and breath audio signals and motivates a comprehensive follow-up research study on a wider data sample, given the evident advantages of a low-cost, highly scalable digital COVID-19 diagnostic tool.

Download Full-text

Design of Human Machine Interface Based on ARM and Linux

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.241-244.2714 ◽

2012 ◽

Vol 241-244 ◽

pp. 2714-2717

Author(s):

Kun Zhang ◽

Xi Wei Peng

Keyword(s):

Low Cost ◽

Hardware Design ◽

Control Unit ◽

Embedded Linux ◽

Human Machine Interface ◽

Device Drivers ◽

Machine Interface ◽

Micro Control Unit

In order to provide more convenient options for users and developers, the design of Human Machine Interface (HMI) based on ARM and embedded Linux is put forward. It makes full use of multiple peripherals of ARM and flexibility of Linux OS. Firstly, hardware design of the HMI system is presented. Then methods of embedded Linux transplanting and the device drivers programming are discussed. Finally, running results and applications of the designed HMI are considered. The design combines the features of traditional HMI and Micro Control Unit (MCU) HMI, including low cost, rich interfaces and easy programming.

Download Full-text

BengaliNet: A Low-Cost Novel Convolutional Neural Network for Bengali Handwritten Characters Recognition

Applied Sciences ◽

10.3390/app11156845 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6845

Author(s):

Abu Sayeed ◽

Jungpil Shin ◽

Md. Al Mehedi Hasan ◽

Azmain Yakin Srizon ◽

Md. Mehedi Hasan

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Network Architecture ◽

Character Recognition ◽

High Performance ◽

Low Cost ◽

Traditional Learning ◽

Neural Network Architecture ◽

Handwritten Character Recognition ◽

Handwritten Character

As it is the seventh most-spoken language and fifth most-spoken native language in the world, the domain of Bengali handwritten character recognition has fascinated researchers for decades. Although other popular languages i.e., English, Chinese, Hindi, Spanish, etc. have received many contributions in the area of handwritten character recognition, Bengali has not received many noteworthy contributions in this domain because of the complex curvatures and similar writing fashions of Bengali characters. Previously, studies were conducted by using different approaches based on traditional learning, and deep learning. In this research, we proposed a low-cost novel convolutional neural network architecture for the recognition of Bengali characters with only 2.24 to 2.43 million parameters based on the number of output classes. We considered 8 different formations of CMATERdb datasets based on previous studies for the training phase. With experimental analysis, we showed that our proposed system outperformed previous works by a noteworthy margin for all 8 datasets. Moreover, we tested our trained models on other available Bengali characters datasets such as Ekush, BanglaLekha, and NumtaDB datasets. Our proposed architecture achieved 96–99% overall accuracies for these datasets as well. We believe our contributions will be beneficial for developing an automated high-performance recognition tool for Bengali handwritten characters.

Download Full-text

Binary and Multiclass Text Classification by Means of Separable Convolutional Neural Network

Inventions ◽

10.3390/inventions6040070 ◽

2021 ◽

Vol 6 (4) ◽

pp. 70

Author(s):

Elena Solovyeva ◽

Ali Abdullah

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Recurrent Neural Networks ◽

Low Cost ◽

Computational Cost ◽

High Accuracy ◽

Activation Functions ◽

Fully Connected ◽

Fully Connected Networks

In this paper, the structure of a separable convolutional neural network that consists of an embedding layer, separable convolutional layers, convolutional layer and global average pooling is represented for binary and multiclass text classifications. The advantage of the proposed structure is the absence of multiple fully connected layers, which is used to increase the classification accuracy but raises the computational cost. The combination of low-cost separable convolutional layers and a convolutional layer is proposed to gain high accuracy and, simultaneously, to reduce the complexity of neural classifiers. Advantages are demonstrated at binary and multiclass classifications of written texts by means of the proposed networks under the sigmoid and Softmax activation functions in convolutional layer. At binary and multiclass classifications, the accuracy obtained by separable convolutional neural networks is higher in comparison with some investigated types of recurrent neural networks and fully connected networks.

Download Full-text

Optimizing Convolutional Neural Network Accelerator on Low-Cost FPGA

Journal of Circuits System and Computers ◽

10.1142/s0218126621501930 ◽

2021 ◽

pp. 2150193

Author(s):

Truong Quang Vinh ◽

Dinh Viet Hai

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Low Cost ◽

Optimal Number ◽

Data Reuse ◽

Logic Element ◽

Input Buffer ◽

Classification Tasks ◽

Processing Engine ◽

Better Than

Convolutional neural network (CNN) is one of the most promising algorithms that outweighs other traditional methods in terms of accuracy in classification tasks. However, several CNNs, such as VGG, demand a huge computation in convolutional layers. Many accelerators implemented on powerful FPGAs have been introduced to address the problems. In this paper, we present a VGG-based accelerator which is optimized for a low-cost FPGA. In order to optimize the FPGA resource of logic element and memory, we propose a dedicated input buffer that maximizes the data reuse. In addition, we design a low resource processing engine with the optimal number of Multiply Accumulate (MAC) units. In the experiments, we use VGG16 model for inference to evaluate the performance of our accelerator and achieve a throughput of 38.8[Formula: see text]GOPS at a clock speed of 150[Formula: see text]MHz on Intel Cyclone V SX SoC. The experimental results show that our design is better than previous works in terms of resource efficiency.

Download Full-text

Low-cost sensor-integrated 3D-printed personalized prosthetic hands for children with amniotic band syndrome: A case study in sensing pressure distribution on an anatomical human-machine interface (AHMI) using 3D-printed conformal electrode arrays

PLoS ONE ◽

10.1371/journal.pone.0214120 ◽

2019 ◽

Vol 14 (3) ◽

pp. e0214120 ◽

Cited By ~ 8

Author(s):

Yuxin Tong ◽

Ezgi Kucukdeger ◽

Justin Halper ◽

Ellen Cesewski ◽

Elena Karakozoff ◽

...

Keyword(s):

Pressure Distribution ◽

Low Cost ◽

Human Machine Interface ◽

Electrode Arrays ◽

Amniotic Band ◽

Prosthetic Hands ◽

Amniotic Band Syndrome ◽

3D Printed ◽

Machine Interface

Download Full-text

Multi-layer neural network classification of tongue movement ear pressure signal for human machine interface

2010 13th International Conference on Computer and Information Technology (ICCIT) ◽

10.1109/iccitechn.2010.5723896 ◽

2010 ◽

Cited By ~ 2

Author(s):

Khondaker A. Mamun ◽

Manoj Banik ◽

Michael Mace ◽

Mark E. Lutmen ◽

Ravi Vaidyanathan ◽

...

Keyword(s):

Neural Network ◽

Human Machine Interface ◽

Tongue Movement ◽

Pressure Signal ◽

Neural Network Classification ◽

Machine Interface

Download Full-text

Diagnosis of left ventricular hypertrophybased on convolutional neural network

10.21203/rs.2.19295/v1 ◽

2019 ◽

Author(s):

Zini Jian ◽

Xianpei Wang ◽

Jingzhe Zhang ◽

Xinyu Wang ◽

Youbin Deng

Keyword(s):

Neural Network ◽

Left Ventricle ◽

Convolutional Neural Network ◽

Posterior Wall ◽

Video Stream ◽

Detection Algorithm ◽

Left Ventricular ◽

Diagnostic Significance ◽

Design Algorithm ◽

Actual Measurement

Abstract Background: Clinically, doctors obtain the left ventricular posterior wall thickness (LVPWT) mainly by observing ultrasonic echocardiographic video stream to capture a single frame of images with diagnostic significance, and then mark two key points on both sides of the posterior wall of the left ventricle with their own experience for computer measurement. In the actual measurement, the doctor's selection point is subjective, which is not only time-consuming and laborious, but also difficult to accurately locate the edge, which will bring errors to the measurement results. Methods: In this paper, a convolutional neural network model of left ventricular posterior wall positioning was built under the TensorFlow framework, and the target region images were obtained after the positioning results were processed by non-local mean filtering and opening operation. Then the edge detection algorithm based on threshold segmentation is used. After the contour was extracted by adjusting the segmentation threshold through prior analysis and the OTSU algorithm, the design algorithm completed the computer selection point measurement of the thickness of the posterior wall of the left ventricle. Results: The proposed method can effectively extract the left ventricular posterior wall contour and measure its thickness. The experimental results show that the relative error between the measurement result and the hospital measurement value is less than 15%, which is less than 20% of the acceptable repeatability error in clinical practice. Conclusions: Therefore, the method proposed in this paper not only has the advantage of less manual intervention, but also can reduce the workload of doctors.

Download Full-text

ON SELECTING IMAGES FROM AN UNAIMED VIDEO STREAM FOR PHOTOGRAMMETRIC MODELLING

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-v-2-2020-389-2020 ◽

2020 ◽

Vol V-2-2020 ◽

pp. 389-394

Author(s):

P. Rönnholm ◽

M. T. Vaaja ◽

H. Kauhanen ◽

T. Klockars

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

3D Models ◽

Video Stream ◽

Test Case ◽

Data Sets ◽

Image Block ◽

Imaging Geometry ◽

Good Imaging ◽

Voxel Representation

Abstract. In this paper, we illustrate how convolutional neural networks and voxel-based processing together with voxel visualizations can be utilized for the selection of unaimed images for a photogrammetric image block. Our research included the detection of an ear from images with a convolutional neural network, computation of image orientations with a structure-from-motion algorithm, visualization of camera locations in a voxel representation to detect the goodness of the imaging geometry, rejection of unnecessary images with an XYZ buffer, the creation of 3D models in two different example cases, and the comparison of resulting 3D models. Two test data sets were taken of an ear with the video recorder of a mobile phone. In the first test case, a special emphasis was taken to ensure good imaging geometry. On the contrary, in the second test case the trajectory was limited to approximately horizontal movement, leading to poor imaging geometry. A convolutional neural network together with an XYZ buffer managed to select a useful set of images for the photogrammetric 3D measuring phase. The voxel representation well illustrated the imaging geometry and has potential for early detection where data is suitable for photogrammetric modelling. The comparison of 3D models revealed that the model from poor imaging geometry was noisy and flattened. The results emphasize the importance of good imaging geometry.

Download Full-text