Design and Validation of a Portable Machine Learning-Based Electronic Nose

Yixu Huang; Iyll-Joon Doh; Euiwon Bae

doi:10.3390/s21113923

Design and Validation of a Portable Machine Learning-Based Electronic Nose

Sensors ◽

10.3390/s21113923 ◽

2021 ◽

Vol 21 (11) ◽

pp. 3923

Author(s):

Yixu Huang ◽

Iyll-Joon Doh ◽

Euiwon Bae

Keyword(s):

Machine Learning ◽

Metal Oxide ◽

Electronic Nose ◽

Gas Sensors ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Support Vector ◽

Metal Oxide Sensors ◽

Metal Oxide Gas Sensors

Volatile organic compounds (VOCs) are chemicals emitted by various groups, such as foods, bacteria, and plants. While there are specific pathways and biological features significantly related to such VOCs, detection of these is achieved mostly by human odor testing or high-end methods such as gas chromatography–mass spectrometry that can analyze the gaseous component. However, odor characterization can be quite helpful in the rapid classification of some samples in sufficient concentrations. Lower-cost metal-oxide gas sensors have the potential to allow the same type of detection with less training required. Here, we report a portable, battery-powered electronic nose system that utilizes multiple metal-oxide gas sensors and machine learning algorithms to detect and classify VOCs. An in-house circuit was designed with ten metal-oxide sensors and voltage dividers; an STM32 microcontroller was used for data acquisition with 12-bit analog-to-digital conversion. For classification of target samples, a supervised machine learning algorithm such as support vector machine (SVM) was applied to classify the VOCs based on the measurement results. The coefficient of variation (standard deviation divided by mean) of 8 of the 10 sensors stayed below 10%, indicating the excellent repeatability of these sensors. As a proof of concept, four different types of wine samples and three different oil samples were classified, and the training model reported 100% and 98% accuracy based on the confusion matrix analysis, respectively. When the trained model was challenged against new sets of data, sensitivity and specificity of 98.5% and 98.6% were achieved for the wine test and 96.3% and 93.3% for the oil test, respectively, when the SVM classifier was used. These results suggest that the metal-oxide sensors are suitable for usage in food authentication applications.

Download Full-text

Classification of Sentiment of Reviews using Supervised Machine Learning Techniques

International Journal of Rough Sets and Data Analysis ◽

10.4018/ijrsda.2017010104 ◽

2017 ◽

Vol 4 (1) ◽

pp. 56-74 ◽

Cited By ~ 14

Author(s):

Abinash Tripathy ◽

Santanu Kumar Rath

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Performance Parameters ◽

Linear Discriminant ◽

Learning Techniques

Sentiment analysis helps to determine hidden intention of the concerned author of any topic and provides an evaluation report on the polarity of any document. The polarity may be positive, negative or neutral. It is observed that very often the data associated with the sentiment analysis consist of the feedback given by various specialists on any topic or product. Thus, the review may be categorized properly into any sort of class based on the polarity, in order to have a good knowledge about the product. This article proposes an approach to classify the review dataset made on basis of sentiment analysis into different polarity groups. Four machine learning algorithms viz., Naive Bayes (NB), Support Vector Machine (SVM), Random Forest, and Linear Discriminant Analysis (LDA) have been considered in this paper for classification process. The obtained result on values of accuracy of the algorithms are critically examined by using different performance parameters, applied on two different datasets.

Download Full-text

Classification of Sentiment of Reviews using Supervised Machine Learning Techniques

Cognitive Analytics ◽

10.4018/978-1-7998-2460-2.ch009 ◽

2020 ◽

pp. 143-163

Author(s):

Abinash Tripathy ◽

Santanu Kumar Rath

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Machine Learning Techniques ◽

Support Vector ◽

Evaluation Report ◽

Linear Discriminant ◽

Learning Techniques

Download Full-text

Classification of Sentiment on Business Data for Decision Making using Supervised Machine Learning Methods

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.c6086.029320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 3595-3600

Keyword(s):

Machine Learning ◽

Random Forest ◽

Sentiment Analysis ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Support Vector ◽

Product Review ◽

Data Set ◽

Random Forest Classification

Sentiment analysis is deals with the classification of sentiments expressed in a particular document. The analysis of user generated data by using sentiment analysis is very useful for knowing the opinion of a crowd. This paper is mainly aimed to tackle the problem of polarity categorization of sentiment analysis. A Detailed description of the sentiment analysis process is also given. Product review data set from UCI repository is used for analysis. This paper is giving a comparative analysis of four supervised machine learning algorithms namely Naive Bayes, Support Vector Machine, Decision Tree and Random Forest which are used for product review analysis. The result shows that, Random Forest classification algorithm provides better accuracy than other three algorithms

Download Full-text

Delineating Smallholder Maize Farms from Sentinel-1 Coupled with Sentinel-2 Data Using Machine Learning

Sustainability ◽

10.3390/su13094728 ◽

2021 ◽

Vol 13 (9) ◽

pp. 4728

Author(s):

Zinhle Mashaba-Munghemezulu ◽

George Johannes Chirima ◽

Cilence Munghemezulu

Keyword(s):

Machine Learning ◽

Food Security ◽

Rural Communities ◽

Machine Learning Algorithms ◽

Support Vector ◽

Subsistence Agriculture ◽

Smallholder Farms ◽

Main Driver ◽

Sentinel 2

Rural communities rely on smallholder maize farms for subsistence agriculture, the main driver of local economic activity and food security. However, their planted area estimates are unknown in most developing countries. This study explores the use of Sentinel-1 and Sentinel-2 data to map smallholder maize farms. The random forest (RF), support vector (SVM) machine learning algorithms and model stacking (ST) were applied. Results show that the classification of combined Sentinel-1 and Sentinel-2 data improved the RF, SVM and ST algorithms by 24.2%, 8.7%, and 9.1%, respectively, compared to the classification of Sentinel-1 data individually. Similarities in the estimated areas (7001.35 ± 1.2 ha for RF, 7926.03 ± 0.7 ha for SVM and 7099.59 ± 0.8 ha for ST) show that machine learning can estimate smallholder maize areas with high accuracies. The study concludes that the single-date Sentinel-1 data were insufficient to map smallholder maize farms. However, single-date Sentinel-1 combined with Sentinel-2 data were sufficient in mapping smallholder farms. These results can be used to support the generation and validation of national crop statistics, thus contributing to food security.

Download Full-text

Financial Context News Sentiment Analysis for the Lithuanian Language

Applied Sciences ◽

10.3390/app11104443 ◽

2021 ◽

Vol 11 (10) ◽

pp. 4443

Author(s):

Rokas Štrimaitis ◽

Pavel Stefanovič ◽

Simona Ramanauskaitė ◽

Asta Slotkienė

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Short Term Memory ◽

Machine Learning Algorithms ◽

Supervised Machine Learning ◽

Experimental Investigations ◽

Support Vector ◽

Applied Machine Learning ◽

Bayes Algorithm ◽

Website Content

Financial area analysis is not limited to enterprise performance analysis. It is worth analyzing as wide an area as possible to obtain the full impression of a specific enterprise. News website content is a datum source that expresses the public’s opinion on enterprise operations, status, etc. Therefore, it is worth analyzing the news portal article text. Sentiment analysis in English texts and financial area texts exist, and are accurate, the complexity of Lithuanian language is mostly concentrated on sentiment analysis of comment texts, and does not provide high accuracy. Therefore in this paper, the supervised machine learning model was implemented to assign sentiment analysis on financial context news, gathered from Lithuanian language websites. The analysis was made using three commonly used classification algorithms in the field of sentiment analysis. The hyperparameters optimization using the grid search was performed to discover the best parameters of each classifier. All experimental investigations were made using the newly collected datasets from four Lithuanian news websites. The results of the applied machine learning algorithms show that the highest accuracy is obtained using a non-balanced dataset, via the multinomial Naive Bayes algorithm (71.1%). The other algorithm accuracies were slightly lower: a long short-term memory (71%), and a support vector machine (70.4%).

Download Full-text

Classification of Diffusion Tensor Metrics for the Diagnosis of a Myelopathic Cord Using Machine Learning

International Journal of Neural Systems ◽

10.1142/s0129065717500368 ◽

2018 ◽

Vol 28 (02) ◽

pp. 1750036 ◽

Cited By ~ 8

Author(s):

Shuqiang Wang ◽

Yong Hu ◽

Yanyan Shen ◽

Hanxiong Li

Keyword(s):

Machine Learning ◽

Surgical Planning ◽

Diffusion Tensor ◽

Mean Value ◽

Machine Learning Algorithms ◽

Support Vector ◽

Svm Classifier ◽

Control Groups ◽

Diffusion Tensor Imaging Dti

In this study, we propose an automated framework that combines diffusion tensor imaging (DTI) metrics with machine learning algorithms to accurately classify control groups and groups with cervical spondylotic myelopathy (CSM) in the spinal cord. The comparison between selected voxel-based classification and mean value-based classification were performed. A support vector machine (SVM) classifier using a selected voxel-based dataset produced an accuracy of 95.73%, sensitivity of 93.41% and specificity of 98.64%. The efficacy of each index of diffusion for classification was also evaluated. Using the proposed approach, myelopathic areas in CSM are detected to provide an accurate reference to assist spine surgeons in surgical planning in complicated cases.

Download Full-text

PREDICTIVE MODELLING AND ANALYTICS FOR DIABETES USING A MACHINE LEARNING APPROACH

INFORMATION TECHNOLOGY IN INDUSTRY ◽

10.17762/itii.v9i1.121 ◽

2021 ◽

Vol 9 (1) ◽

pp. 215-223

Author(s):

Prateek Mishra, Dr.Anurag Sharma, Dr. Abhishek Badholia

Keyword(s):

Machine Learning ◽

Support Vector Machine ◽

Machine Learning Algorithms ◽

Computational Method ◽

Supervised Machine Learning ◽

Undiagnosed Diabetes ◽

Support Vector ◽

Entire Body ◽

Data Manipulation ◽

Kernel Support Vector Machine

Adverse effects can be seen in the entire body due to the major disorders known as Diabetes. The risk of dangers like diabetic nephropathy, cardiac stroke and other disorders can increase severally because of the undiagnosed diabetes. Around the globe the people are suffering from this disease. For a healthy life early detection of this disease is very curtail. As the causes of the diabetes is increasing rapidly this disease might turn up as a reason for worldwide concern. Increasing the chances for a more accurate predictions and form experiences automatic learning by computational method may be provided by Machine Learning (ML). With the help of R data manipulation tool for trends development and with risk factor patterns detection in Pima Indian diabetes technique of machine learning is been used in the current researches. With the use of R data manipulation tool analysis and development five different predictive models is done for the categorization of patients into diabetic and non- diabetic. supervised machine learning algorithms namely multifactor dimensionality reduction (MDR), k-nearest neighbor (k-NN), artificial neural network (ANN) radial basis function (RBF) kernel support vector machine and linear kernel support vector machine (SVM-linear) are used for this purpose.

Download Full-text

A Comparison of Machine Learning Algorithms for the Segmentation and Classification of Snow Micro Penetrometer Profiles on Arctic Sea Ice

10.5194/egusphere-egu21-15637 ◽

2021 ◽

Author(s):

Julia Kaltenborn ◽

Viviane Clay ◽

Amy R. Macfarlane ◽

Joshua Michael Lloyd King ◽

Martin Schneebeli

Keyword(s):

Machine Learning ◽

Sea Ice ◽

Arctic Sea Ice ◽

Machine Learning Algorithms ◽

Training Data ◽

Support Vector ◽

Snow Layer ◽

Arctic Sea ◽

Execution Speed

Snow-layer classification is an essential diagnostic task for a wide variety of cryospheric science and climate research applications. Traditionally, these measurements are made in snow pits, requiring trained operators and a substantial time commitment. The SnowMicroPen (SMP), a portable high-resolution snow penetrometer, has been demonstrated as a capable tool for rapid snow grain classification and layer type segmentation through statistical inversion of its mechanical signal. The manual classification of the SMP profiles requires time and training and becomes infeasible for large datasets.Here, we introduce a novel set of SMP measurements collected during the MOSAiC expedition and apply Machine Learning (ML) algorithms to automatically classify and segment SMP profiles of snow on Arctic sea ice. To this end, different supervised and unsupervised ML methods, including Random Forests, Support Vector Machines, Artificial Neural Networks, and k-means Clustering, are compared. A subsequent segmentation of the classified data results in distinct layers and snow grain markers for the SMP profiles. The models are trained with the dataset by King et al. (2020) and the MOSAiC SMP dataset. The MOSAiC dataset is a unique and extensive dataset characterizing seasonal and spatial variation of snow on the central Arctic sea-ice.We will test and compare the different algorithms and evaluate the algorithms&#8217; effectiveness based on the need for initial dataset labeling, execution speed, and ease of implementation. In particular, we will compare supervised to unsupervised methods, which are distinguished by their need for labeled training data.The implementation of different ML algorithms for SMP profile classification could provide a fast and automatic grain type classification and snow layer segmentation. Based on the gained knowledge from the algorithms&#8217; comparison, a tool can be built to provide scientists from different fields with an immediate SMP profile classification and segmentation.&#160;&#160;King, J., Howell, S., Brady, M., Toose, P., Derksen, C., Haas, C., & Beckers, J. (2020). Local-scale variability of snow density on Arctic sea ice. The Cryosphere, 14(12), 4323-4339, https://doi.org/10.5194/tc-14-4323-2020.

Download Full-text

Tremor Identification Using Machine Learning in Parkinson's Disease

Early Detection of Neurological Disorders Using Machine Learning Systems - Advances in Medical Technologies and Clinical Practice ◽

10.4018/978-1-5225-8567-1.ch008 ◽

2019 ◽

pp. 128-151

Author(s):

Angana Saikia ◽

Vinayak Majhi ◽

Masaraf Hussain ◽

Sudip Paul ◽

Amitava Datta

Keyword(s):

Machine Learning ◽

Parkinson’S Disease ◽

Support Vector Machine ◽

Parkinson's Disease ◽

Discriminant Analysis ◽

Learning Algorithms ◽

The Body ◽

Machine Learning Algorithms ◽

Support Vector

Tremor is an involuntary quivering movement or shake. Characteristically occurring at rest, the classic slow, rhythmic tremor of Parkinson's disease (PD) typically starts in one hand, foot, or leg and can eventually affect both sides of the body. The resting tremor of PD can also occur in the jaw, chin, mouth, or tongue. Loss of dopamine leads to the symptoms of Parkinson's disease and may include a tremor. For some people, a tremor might be the first symptom of PD. Various studies have proposed measurable technologies and the analysis of the characteristics of Parkinsonian tremors using different techniques. Various machine-learning algorithms such as a support vector machine (SVM) with three kernels, a discriminant analysis, a random forest, and a kNN algorithm are also used to classify and identify various kinds of tremors. This chapter focuses on an in-depth review on identification and classification of various Parkinsonian tremors using machine learning algorithms.

Download Full-text

MACHINE LEARNING ALGORITHMS FOR IDENTIFICATION OF ABNORMAL GLOW CURVES AND ASSOCIATED ABNORMALITY IN CaSO4:DY-BASED PERSONNEL MONITORING DOSIMETERS

Radiation Protection Dosimetry ◽

10.1093/rpd/ncaa108 ◽

2020 ◽

Vol 190 (3) ◽

pp. 342-351

Author(s):

Munir S Pathan ◽

S M Pradhan ◽

T Palani Selvam

Keyword(s):

Machine Learning ◽

Glow Curve ◽

Good Accuracy ◽

Machine Learning Algorithms ◽

Support Vector ◽

Computationally Efficient ◽

Artificial Neural Network Ann ◽

First Time

Abstract In the present study, machine learning (ML) methods for the identification of abnormal glow curves (GC) of CaSO4:Dy-based thermoluminescence dosimeters in individual monitoring are presented. The classifier algorithms, random forest (RF), artificial neural network (ANN) and support vector machine (SVM) are employed for identifying not only the abnormal glow curve but also the type of abnormality. For the first time, the simplest and computationally efficient algorithm based on RF is presented for GC classifications. About 4000 GCs are used for the training and validation of ML algorithms. The performance of all algorithms is compared by using various parameters. Results show a fairly good accuracy of 99.05% for the classification of GCs by RF algorithm. Whereas 96.7% and 96.1% accuracy is achieved using ANN and SVM, respectively. The RF-based classifier is recommended for GC classification as well as in assisting the fault determination of the TLD reader system.

Download Full-text