scholarly journals Research on the Prediction of Green Plum Acidity Based on Improved XGBoost

Sensors ◽  
2021 ◽  
Vol 21 (3) ◽  
pp. 930
Author(s):  
Yang Liu ◽  
Honghong Wang ◽  
Yeqi Fei ◽  
Ying Liu ◽  
Luxiang Shen ◽  
...  

The acidity of green plum has an important influence on the fruit’s deep processing. Traditional physical and chemical analysis methods for green plum acidity detection are destructive, time-consuming, and unable to achieve online detection. In response, a rapid and non-destructive detection method based on hyperspectral imaging technology was studied in this paper. Research on prediction performance comparisons between supervised learning methods and unsupervised learning methods is currently popular. To further improve the accuracy of component prediction, a new hyperspectral imaging system was developed, and the kernel principle component analysis—linear discriminant analysis—extreme gradient boosting algorithm (KPCA-LDA-XGB) model was proposed to predict the acidity of green plum. The KPCA-LDA-XGB model is a supervised learning model combined with the extreme gradient boosting algorithm (XGBoost), kernel principal component analysis (KPCA), and linear discriminant analysis (LDA). The experimental results proved that the KPCA-LDA-XGB model offers good acidity predictions for green plum, with a correlation coefficient (R) of 0.829 and a root mean squared error (RMSE) of 0.107 for the prediction set. Compared with the basic XGBoost model, the KPCA-LDA-XGB model showed a 79.4% increase in R and a 31.2% decrease in RMSE. The use of linear, radial basis function (RBF), and polynomial (Poly) kernel functions were also compared and analyzed in this paper to further optimize the KPCA-LDA-XGB model.

Sensors ◽  
2022 ◽  
Vol 22 (1) ◽  
pp. 367
Author(s):  
Janez Lapajne ◽  
Matej Knapič ◽  
Uroš Žibrat

Hyperspectral imaging is a popular tool used for non-invasive plant disease detection. Data acquired with it usually consist of many correlated features; hence most of the acquired information is redundant. Dimensionality reduction methods are used to transform the data sets from high-dimensional, to low-dimensional (in this study to one or a few features). We have chosen six dimensionality reduction methods (partial least squares, linear discriminant analysis, principal component analysis, RandomForest, ReliefF, and Extreme gradient boosting) and tested their efficacy on a hyperspectral data set of potato tubers. The extracted or selected features were pipelined to support vector machine classifier and evaluated. Tubers were divided into two groups, healthy and infested with Meloidogyne luci. The results show that all dimensionality reduction methods enabled successful identification of inoculated tubers. The best and most consistent results were obtained using linear discriminant analysis, with 100% accuracy in both potato tuber inside and outside images. Classification success was generally higher in the outside data set, than in the inside. Nevertheless, accuracy was in all cases above 0.6.


Author(s):  
David Zhang ◽  
Xiao-Yuan Jing ◽  
Jian Yang

This chapter presents two straightforward image projection techniques — two-dimensional (2D) image matrix-based principal component analysis (IMPCA, 2DPCA) and 2D image matrix-based Fisher linear discriminant analysis (IMLDA, 2DLDA). After a brief introduction, we first introduce IMPCA. Then IMLDA technology is given. As a result, we summarize some useful conclusions.


Electronics ◽  
2019 ◽  
Vol 8 (8) ◽  
pp. 870
Author(s):  
Tengteng Wen ◽  
Dehan Luo ◽  
Yongjie Ji ◽  
Pingzhong Zhong

Odor reproduction, a branch of machine olfaction, is a technology through which a machine represents various odors by blending several odor sources in different proportions and releases them. In this paper, an odor reproduction system is proposed. The system includes an atomization-based odor dispenser using 16 micro-porous piezoelectric transducers. The authors propose the use of an electronic nose combined with a Principal Component Analysis–Linear Discriminant Analysis (PCA–LDA) model to evaluate the effectiveness of the system. The results indicate that the model can be used to evaluate the system.


2019 ◽  
Vol 3 (2) ◽  
pp. 72
Author(s):  
Widi Astuti ◽  
Adiwijaya Adiwijaya

Cancer is one of the leading causes of death globally. Early detection of cancer allows better treatment for patients. One method to detect cancer is using microarray data classification. However, microarray data has high dimensions which complicates the classification process. Linear Discriminant Analysis is a classification technique which is easy to implement and has good accuracy. However, Linear Discriminant Analysis has difficulty in handling high dimensional data. Therefore, Principal Component Analysis, a feature extraction technique is used to optimize Linear Discriminant Analysis performance. Based on the results of the study, it was found that usage of Principal Component Analysis increases the accuracy of up to 29.04% and f-1 score by 64.28% for colon cancer data.


2021 ◽  
Vol 2021 ◽  
pp. 1-11
Author(s):  
Heping Li ◽  
Yu Ren ◽  
Fan Yu ◽  
Dongliang Song ◽  
Lizhe Zhu ◽  
...  

To facilitate the enhanced reliability of Raman-based tumor detection and analytical methodologies, an ex vivo Raman spectral investigation was conducted to identify distinct compositional information of healthy (H), ductal carcinoma in situ (DCIS), and invasive ductal carcinoma (IDC). Then, principal component analysis-linear discriminant analysis (PCA-LDA) and principal component analysis-support vector machine (PCA-SVM) models were constructed for distinguishing spectral features among different tissue groups. Spectral analysis highlighted differences in levels of unsaturated and saturated lipids, carotenoids, protein, and nucleic acid between healthy and cancerous tissue and variations in the levels of nucleic acid, protein, and phenylalanine between DCIS and IDC. Both classification models were principal component analysis-linear discriminant analysis to be extremely efficient on discriminating tissue pathological types with 99% accuracy for PCA-LDA and 100%, 100%, and 96.7% for PCA-SVM analysis based on linear kernel, polynomial kernel, and radial basis function (RBF), respectively, while PCA-SVM algorithm greatly simplified the complexity of calculation without sacrificing performance. The present study demonstrates that Raman spectroscopy combined with multivariate analysis technology has considerable potential for improving the efficiency and performance of breast cancer diagnosis.


Sign in / Sign up

Export Citation Format

Share Document