Diagnosis of early gastric cancer based on fluorescence hyperspectral imaging technology combined with partial‐least‐square discriminant analysis and support vector machine

2019 ◽  
Vol 12 (5) ◽  
Author(s):  
Yuanpeng Li ◽  
Xiaojuan Xie ◽  
Xinhao Yang ◽  
Liu Guo ◽  
Zhao Liu ◽  
...  

Worldwide, breast cancer is the leading type of cancer in women accounting for 25% of all cases. Survival rates in the developed countries are comparatively higher with that of developing countries. This had led to the importance of computer aided diagnostic methods for early detection of breast cancer disease. This eventually reduces the death rate. This paper intents the scope of the biomarker that can be used to predict the breast cancer from the anthropometric data. This experimental study aims at computing and comparing various classification models (Binary Logistic Regression, Ball Vector Machine (BVM), C4.5, Partial Least Square (PLS) for Classification, Classification Tree, Cost sensitive Classification Tree, Cost sensitive Decision Tree, Support Vector Machine for Classification, Core Vector Machine, ID3, K-Nearest Neighbor, Linear Discriminant Analysis (LDA), Log-Reg TRIRLS, Multi Layer Perceptron (MLP), Multinomial Logistic Regression (MLR), Naïve Bayes (NB), PLS for Discriminant Analysis, PLS for LDA, Random Tree (RT), Support Vector Machine SVM) for the UCI Coimbra breast cancer dataset. The feature selection algorithms (Backward Logit, Fisher Filtering, Forward Logit, ReleifF, Step disc) are worked out to find out the minimum attributes that can achieve a better accuracy. To ascertain the accuracy results, the Jack-knife cross validation method for the algorithms is conducted and validated. The Core vector machine classification algorithm outperforms the other nineteen algorithms with an accuracy of 82.76%, sensitivity of 76.92% and specificity of 87.50% for the selected three attributes, Age, Glucose and Resistin using ReleifF feature selection algorithm.


2020 ◽  
Vol 28 (2) ◽  
pp. 70-80 ◽  
Author(s):  
Perez Mukasa ◽  
Collins Wakholi ◽  
Akbar Faqeerzada Mohammad ◽  
Eunsoo Park ◽  
Jayoung Lee ◽  
...  

The combination of hyperspectral imaging with multivariate data analysis methods has recently been applied to develop a nondestructive technique, required to determine the seed viability of artificially aged vegetable and cereal seeds. In this study, the potential of shortwave infrared hyperspectral imaging to determine the viability of naturally aged seeds was investigated and thereafter a model for online seed sorting system was developed. The hyperspectral images of 400 Hinoki cypress tree seeds were acquired, and germination tests were conducted for viability confirmation, which indicated 31.5% of the viable seeds. Partial least square discriminant analysis models with 179 variables in the wavelength region of 1000–1800 nm were developed with a maximum model accuracy of 98.4% and 93.8% in both the calibration and validation sets, respectively. The partial least square discriminant analysis beta coefficient revealed the key wavelengths to differentiate viable from nonviable seeds, determined based on the differences in the chemical compositions of the seeds, including their lipid and fatty acid contents, which may control the germination ability of the seeds. The most effective wavelengths were selected using two model-based variable selection methods (i.e., the variable importance of projection (15 variables) and the successive projections algorithm (8 variables)) to develop the model. The successive projections algorithm wavelength selection method was considered to develop a viability model, and its application to the raw data resulted in a prediction accuracy of 94.7% in the calibration set and 92.2% in the validation set. These results demonstrate the potential of shortwave infrared hyperspectral imaging spectroscopy as a powerful nondestructive method to determine the viability of Hinoki cypress seeds. This method could be applied to develop an online seed sorting system for seed companies and nurseries.


Author(s):  
Binu Devassy ◽  
Sony George

Firmness is one of the most important quality measures of strawberries, and is related to other aspects of the fruit, such as flavour, ripeness and internal characteristics. The most popular method for measuring firmness is puncturing with a penetrometer, which is destructive and time-consuming. In the present study, we make an attempt to predict the firmness of strawberries in a fast, non-destructive and non-contact way using hyperspectral imaging (HSI) and data analysis with various regression techniques. The primary goal of this research is to investigate and compare the firmness prediction capability of seven prominent regression techniques. We have performed HSI data acquisition of 150 strawberries and optimised seven regression models using the spectral information to predict strawberry firmness. These models are linear, ridge, lasso, k-neighbours, random forest, support vector and partial least square regression. The res ults show that HSI data with regression models has the potential to predict firmness in a rapid, non-destructive manner. Out of these seven regression models, the k-neighbours regression model outperformed all other methods with a standard error of prediction of 0.14, which is better than that of the state-of-the-art results.


2020 ◽  
Vol 28 (5-6) ◽  
pp. 255-266 ◽  
Author(s):  
Elise A Kho ◽  
Jill N Fernandes ◽  
Andrew C Kotze ◽  
Glen P Fox ◽  
Maggy Lord ◽  
...  

Heavy infestations of the blood-sucking gastrointestinal nematodes, Haemonchus contortus can cause severe anaemia in sheep and leakage of blood into the faeces, leading to morbidity and mortality. Early and accurate diagnosis of infections is critical for timely treatment of sheep, minimizing production and sheep welfare impacts. In pursuit of a quick and easy measure of H. contortus infections, we investigated the use of portable visible near infrared spectrometers for detecting the presence of haemoglobin in sheep faeces as an indicator of H. contortus infection. Calibration models built within the 400–600 nm region by partial least square regression resulted in acceptable prediction accuracies (r 2 p > 0.70 and root mean squared error of prediction <2.64 µg Hb mg−1 faeces) for haemoglobin quantification using two spectrometers. The prediction results from support vector machine regression further improved the prediction of haemoglobin in moist sheep faeces (r 2 p > 0.87 and root mean squared error of prediction <2.00 µg haemoglobin mg−1 faeces). Based on a threshold for anthelmintic treatment of 3 µg Hb mg−1 faeces, both the partial least square and support vector machine models showed high sensitivity (89%) and high specificity (>77%). The specificity of the prediction model for detecting haemoglobin in sheep faeces may be improved by adding more variations in faecal composition into the calibration model. Our success in detecting haemoglobin in sheep faeces, following minimal sample preparation, suggests that with further development, vis–near infrared spectroscopy can provide a sensitive and convenient method for on-farm diagnosis of H. contortus infections.


2020 ◽  
Vol 10 (19) ◽  
pp. 6724
Author(s):  
Youngwook Seo ◽  
Ahyeong Lee ◽  
Balgeum Kim ◽  
Jongguk Lim

(1) Background: The general use of food-processing facilities in the agro-food industry has increased the risk of unexpected material contamination. For instance, grain flours have similar colors and shapes, making their detection and isolation from each other difficult. Therefore, this study is aimed at verifying the feasibility of detecting and isolating grain flours by using hyperspectral imaging technology and developing a classification model of grain flours. (2) Methods: Multiple hyperspectral images were acquired through line scanning methods from reflectance of visible and near-infrared wavelength (400–1000 nm), reflectance of shortwave infrared wavelength (900–1700 nm), and fluorescence (400–700 nm) by 365 nm ultraviolet (UV) excitation. Eight varieties of grain flours were prepared (rice: 4, starch: 4), and the particle size and starch damage content were measured. To develop the classification model, four multivariate analysis methods (linear discriminant analysis (LDA), partial least-square discriminant analysis, support vector machine, and classification and regression tree) were implemented with several pre-processing methods, and their classification results were compared with respect to accuracy and Cohen’s kappa coefficient obtained from confusion matrices. (3) Results: The highest accuracy was achieved as 97.43% through short-wavelength infrared with normalization in the spectral domain. The submission of the developed classification model to the hyperspectral images showed that the fluorescence method achieves the highest accuracy of 81% using LDA. (4) Conclusions: In this study, the potential of non-destructive classification of rice and starch flours using multiple hyperspectral modalities and chemometric methods were demonstrated.


2019 ◽  
Vol 11 (14) ◽  
pp. 1966-1975 ◽  
Author(s):  
Marcelo C. A. Marcelo ◽  
Frederico L. F. Soares ◽  
Jorge A. Ardila ◽  
Jailson C. Dias ◽  
Ricardo Pedó ◽  
...  

Classification systems are frequently used in tobacco Green Leaf Threshing (GLT) facilities to assess the chemical characteristics and quality of tobacco leaves.


2014 ◽  
Vol 11 (1) ◽  
pp. 1
Author(s):  
Mas Ezatul Nadia Mohd Ruah ◽  
Nor Fazila Rasaruddin ◽  
Siong Fong Sim ◽  
Mohd Zuli Jaafar

This paper outlines the application of chemometrics and pattern recognition tools to classify palm oil using Fourier Transform Mid Infrared spectroscopy (FT-MIR). FT-MIR spectroscopy is used as an effective analytical tool in order to categorise the oil into the category of unused palm oil and used palm oil for frying. The samples used in this study consist of 28 types of pure palm oil, and 28 types of frying palm oils. FT-MIR spectral was obtained in absorbance mode at the spectral range from 650 cm·1 to 4000 cm·1 using FT-MIR-ATR sample handling. The aim of this work is to develop fast method in discriminating the palm oil by implementing Partial Least Square Discriminant Analysis (PLS-DA), Leaming Vector Quantisation (LVQ) and Support Vector Machine (SVM). Raw FT-MIR spectra were subjected to Savitzky-Golay smoothing and standardised before developing the classification models. The classification model was validated by finding the value of percentage correctly classified using test set for every model in order to show which classifier provided the best classification. In order to improve the performance of the classification model, variable selection method known as /-statistic method was applied. The significant variable in developing classification model was selected through this method. The result revealed that PLS-DA classifier of the standardised data with application of t-statistic showed the best performance with highest percentage correctly classified among the classifiers.


Sign in / Sign up

Export Citation Format

Share Document