Optimised Scaling (OS-2) Regression Applied to near Infrared Diffuse Spectroscopy Data from Food Products

Tomas Isaksson; Ziyi Wang; Bruce Kowalski

doi:10.1255/jnirs.12

Optimised Scaling (OS-2) Regression Applied to near Infrared Diffuse Spectroscopy Data from Food Products

Journal of Near Infrared Spectroscopy ◽

10.1255/jnirs.12 ◽

1993 ◽

Vol 1 (2) ◽

pp. 85-97 ◽

Cited By ~ 7

Author(s):

Tomas Isaksson ◽

Ziyi Wang ◽

Bruce Kowalski

Keyword(s):

Near Infrared ◽

Scatter Correction ◽

Principal Component Regression ◽

Predictive Ability ◽

Principal Component ◽

Calibration Method ◽

Meat Sample ◽

Data Sets ◽

Regression Methods ◽

Calibration Models

A recently presented calibration method, called optimised scaling (OS-2) was tested and compared to multiplicative scatter correction (MSC) and principal component regression (PCR). The predictive ability of these regression methods was tested on eight data sets consisting of diffuse near infrared (NIR) reflectance and transmittance continuous spectra of meat, sausages, soya bean and designed sample sets. Calibration was performed for constituents such as fat, protein, water, carbohydrate, temperature, lactate and glucose. A total of 21 calibration models were validated and compared. OS-2 gave good or promising prediction results for the major constituents with large variation, such as prediction of fat in two of the studied meat sample sets. OS-2 gave poorer prediction results of minor constituents compared to MSC or first derivatives of the data and PCR.

Download Full-text

The Effect of Multiplicative Scatter Correction (MSC) and Linearity Improvement in NIR Spectroscopy

Applied Spectroscopy ◽

10.1366/0003702884429869 ◽

1988 ◽

Vol 42 (7) ◽

pp. 1273-1284 ◽

Cited By ~ 362

Author(s):

Tomas Isaksson ◽

Tormod Næs

Keyword(s):

Near Infrared ◽

Scatter Correction ◽

Principal Component Regression ◽

Nir Spectroscopy ◽

Principal Component ◽

Data Sets ◽

Prediction Errors ◽

Regression Methods ◽

Multiplicative Scatter Correction ◽

Linearity Improvement

Near-infrared (NIR) reflectance spectra of five different food products were measured. The spectra were transformed by multiplicative scatter correction (MSC). Principal component regression (PCR) was performed, on both scatter-corrected and uncorrected spectra. Calibration and prediction were performed for four food constituents: protein, fat, water, and carbohydrates. All regressions gave lower prediction errors (7–68% improvement) by the use of MSC spectra than by the use of uncorrected absorbance spectra. One of these data sets was studied in more detail to clarify the effects of the MSC, by using PCR score, residual, and leverage plots. The improvement by using nonlinear regression methods is indicated.

Download Full-text

A Comparison of Techniques for Modelling Data with Non-Linear Structure

Journal of Near Infrared Spectroscopy ◽

10.1255/jnirs.354 ◽

2003 ◽

Vol 11 (1) ◽

pp. 55-70 ◽

Cited By ~ 6

Author(s):

Laila Stordrange ◽

Olav M. Kvalheim ◽

Per A. Hassel ◽

Dick Malthe-Sørenssen ◽

Fred Olav Libnau

Keyword(s):

Near Infrared ◽

Principal Component Regression ◽

Predictive Ability ◽

Principal Component ◽

Linear Structure ◽

Data Set ◽

Linear Pattern ◽

Near Infrared Spectra ◽

Non Linear ◽

Regression Techniques

Partial least squares (PLS) is a powerful tool for multivariate linear regression. But what if the data show a non-linear structure? Near infrared spectra from a pharmaceutical process were used as a case study. An ANOVA test revealed that the data are well described by a 2nd order polynomial. This work investigates the application of regression techniques that account for slightly non-linear data. The regression techniques investigated are: linearising data by applying transformations, local PLS, i.e. splitting of data, and quadratic PLS. These models were compared with ordinary PLS and principal component regression (PCR). The predictive ability of the models was tested on an independent data set acquired a year later. Using the knowledge of non-linear pattern and important spectral regions, simpler models with better predictive ability can be obtained.

Download Full-text

Multivariate Calibration and Chemometrics for near Infrared Spectroscopy: Which Method?

Journal of Near Infrared Spectroscopy ◽

10.1255/jnirs.283 ◽

2000 ◽

Vol 8 (4) ◽

pp. 229-237 ◽

Cited By ~ 98

Author(s):

Pierre Dardenne ◽

George Sinnaeve ◽

Vincent Baeten

Keyword(s):

Near Infrared ◽

Multivariate Calibration ◽

Scatter Correction ◽

Large Data ◽

Data Sets ◽

Agricultural Commodity ◽

Regression Methods ◽

Standard Normal Variate ◽

Pre Treatment ◽

Artificial Neural Network Ann

The four most important regression methods are evaluated on very large data sets: Multiple Linear Regression (MLR), Partial Least Squares (PLS), Artificial Neural Network (ANN) and a new concept called “LOCAL” (PLS with selection of a calibration sample subset of the closest neighbours for each sample to predict). The Standard Errors of Prediction ( SEPs) are statistically tested and the results show that the regression methods are almost equal and that the data matrices are more important than the fitting methods themselves. The types of pre-treatments (Multiplicative Scatter Correction, Detrend, Standard Normal Variate, derivative etc.) of the spectra are too numerous to be able to test all the combinations. For each test, the pre-treatment found as the best with the PLS method is fixed for the other ones. The second part of the paper emphasises the importance of the number of samples. If any agricultural commodity, and probably any kind of product measured by an NIR instrument, can be considered as a mixture of several constituents, the databases built by collecting actual samples bringing new information can reach hundreds, if not thousands, of samples.

Download Full-text

Measurements of Urea and Glucose in Aqueous Solutions with Dual-Beam Near-Infrared Fourier Transform Spectroscopy

Applied Spectroscopy ◽

10.1366/000370202321115869 ◽

2002 ◽

Vol 56 (12) ◽

pp. 1593-1599 ◽

Cited By ~ 6

Author(s):

Peter Snoer Jensen ◽

Jimmy Bak

Keyword(s):

Aqueous Solutions ◽

Near Infrared ◽

Principal Component Regression ◽

Principal Component ◽

Prediction Errors ◽

Single Beam ◽

Dual Beam ◽

Calibration Models ◽

Ft Ir ◽

Beam Technique

This study investigates the use of a dual-beam, optical null, FT-IR spectrometer to measure trace organic components in aqueous solutions in the combination band region 5000–4000 cm−1. The spectrometer may be used for both single- and dual-beam measurements, thereby facilitating comparison of these two modes of operation. The concentrations of aqueous solutions of urea and glucose in the ranges 0–40 mg/dL and 0–250 mg/dL, respectively, were determined by principal component regression using both modes. The dual-beam technique eliminated instrumental variations present in the single-beam measurements that must be taken into account when quantifying trace components from single-beam spectra. The data obtained with the dual-beam technique resulted in more stable calibration models based on principal component regression. These calibration models need fewer factors and yield lower prediction errors than those based on traditional single-beam data.

Download Full-text

Estimation of Glucosinolates and Anthocyanins in Kale Leaves Grown in a Plant Factory Using Spectral Reflectance

Horticulturae ◽

10.3390/horticulturae7030056 ◽

2021 ◽

Vol 7 (3) ◽

pp. 56

Author(s):

Milon Chowdhury ◽

Viet-Duc Ngo ◽

Md Nafiul Islam ◽

Mohammod Ali ◽

Sumaiya Islam ◽

...

Keyword(s):

Spectral Reflectance ◽

Near Infrared ◽

Diffuse Reflectance Spectroscopy ◽

Principal Component Regression ◽

Principal Component ◽

Least Squares Regression ◽

Regression Methods ◽

Freeze Dried ◽

Plant Factory ◽

Functional Components

The spectral reflectance technique for the quantification of the functional components was applied in different studies for different crops, but related research on kale leaves is limited. This study was conducted to estimate the glucosinolate and anthocyanin components of kale leaves cultivated in a plant factory based on diffuse reflectance spectroscopy through regression methods. Kale was grown in a plant factory under different treatments. After specific periods of transplantation, leaf samples were collected, and reflectance spectra were measured immediately from nine different points on each leaf. The same leaf samples were freeze-dried and stored for analysis of the functional components. Regression procedures, such as principal component regression (PCR), partial least squares regression (PLSR), and stepwise multiple linear regression (SMLR), were applied to relate the functional components with the spectral data. In the laboratory analysis, progoitrin and glucobrassicin, as well as cyanidin and malvidin, were found to be dominating components in glucosinolates and anthocyanins, respectively. From the overall analysis, the SMLR model showed better performance, and the identified wavelengths for estimating the glucosinolates and anthocyanins were in the early near-infrared (NIR) region. Specifically, reflectance at 742, 761, 787, 796, 805, 833, 855, 932, 947, and 1000 nm showed a strong correlation.

Download Full-text

Influence of environmental factors on the detection of blood in sheep faeces using visible–near-infrared spectroscopy as a measure of Haemonchus contortus infection

Parasites & Vectors ◽

10.1186/s13071-020-04468-6 ◽

2020 ◽

Vol 13 (1) ◽

Author(s):

Elise A. Kho ◽

Jill N. Fernandes ◽

Andrew C. Kotze ◽

Glen P. Fox ◽

Maggy T. Sikulu-Lord ◽

...

Keyword(s):

Principal Component Analysis ◽

Haemonchus Contortus ◽

Prediction Accuracy ◽

Near Infrared ◽

Nir Spectroscopy ◽

Principal Component ◽

Component Analysis ◽

Diagnostic Methods ◽

Calibration Models ◽

Good Prediction Accuracy

Abstract Background Existing diagnostic methods for the parasitic gastrointestinal nematode, Haemonchus contortus, are time consuming and require specialised expertise, limiting their utility in the field. A practical, on-farm diagnostic tool could facilitate timely treatment decisions, thereby preventing losses in production and flock welfare. We previously demonstrated the ability of visible–near-infrared (Vis–NIR) spectroscopy to detect and quantify blood in sheep faeces with high accuracy. Here we report our investigation of whether variation in sheep type and environment affect the prediction accuracy of Vis–NIR spectroscopy in quantifying blood in faeces. Methods Visible–NIR spectra were obtained from worm-free sheep faeces collected from different environments and sheep types in South Australia (SA) and New South Wales, Australia and spiked with various sheep blood concentrations. Spectra were analysed using principal component analysis (PCA), and calibration models were built around the haemoglobin (Hb) wavelength region (387–609 nm) using partial least squares regression. Models were used to predict Hb concentrations in spiked faeces from SA and naturally infected sheep faeces from Queensland (QLD). Samples from QLD were quantified using Hemastix® test strip and FAMACHA© diagnostic test scores. Results Principal component analysis showed that location, class of sheep and pooled versus individual samples were factors affecting the Hb predictions. The models successfully differentiated ‘healthy’ SA samples from those requiring anthelmintic treatment with moderate to good prediction accuracy (sensitivity 57–94%, specificity 44–79%). The models were not predictive for blood in the naturally infected QLD samples, which may be due in part to variability of faecal background and blood chemistry between samples, or the difference in validation methods used for blood quantification. PCA of the QLD samples, however, identified a difference between samples containing high and low quantities of blood. Conclusion This study demonstrates the potential of Vis–NIR spectroscopy for estimating blood concentration in faeces from various types of sheep and environmental backgrounds. However, the calibration models developed here did not capture sufficient environmental variation to accurately predict Hb in faeces collected from environments different to those used in the calibration model. Consequently, it will be necessary to establish models that incorporate samples that are more representative of areas where H. contortus is endemic.

Download Full-text

Cox Survival Analysis of Microarray Gene Expression Data Using Correlation Principal Component Regression

Statistical Applications in Genetics and Molecular Biology ◽

10.2202/1544-6115.1153 ◽

2007 ◽

Vol 6 (1) ◽

Cited By ~ 4

Author(s):

Qiang Zhao ◽

Jianguo Sun

Keyword(s):

Gene Expression ◽

Gene Expression Data ◽

Principal Component Regression ◽

Predictive Ability ◽

Principal Component ◽

Microarray Gene Expression Data ◽

Expression Data ◽

Microarray Gene Expression ◽

New Approach ◽

Microarray Gene

Statistical analysis of microarray gene expression data has recently attracted a great deal of attention. One problem of interest is to relate genes to survival outcomes of patients with the purpose of building regression models for the prediction of future patients' survival based on their gene expression data. For this, several authors have discussed the use of the proportional hazards or Cox model after reducing the dimension of the gene expression data. This paper presents a new approach to conduct the Cox survival analysis of microarray gene expression data with the focus on models' predictive ability. The method modifies the correlation principal component regression (Sun, 1995) to handle the censoring problem of survival data. The results based on simulated data and a set of publicly available data on diffuse large B-cell lymphoma show that the proposed method works well in terms of models' robustness and predictive ability in comparison with some existing partial least squares approaches. Also, the new approach is simpler and easy to implement.

Download Full-text

Nondestructive NIR and NIT Determination of Protein, Fat, and Water in Plastic-Wrapped, Homogenized Meat

Applied Spectroscopy ◽

10.1366/0003702924926745 ◽

1992 ◽

Vol 46 (11) ◽

pp. 1685-1694 ◽

Cited By ~ 37

Author(s):

Tomas Isaksson ◽

Charles E. Miller ◽

Tormod Næs

Keyword(s):

Diffuse Reflectance ◽

Near Infrared ◽

Multivariate Calibration ◽

Principal Component Regression ◽

Principal Component ◽

Prediction Errors ◽

Optimal Test ◽

Meat Samples ◽

Water Contents

In this work, the abilities of near-infrared diffuse reflectance (NIR) and transmittance (NIT) spectroscopy to noninvasively determine the protein, fat, and water contents of plastic-wrapped homogenized meat are evaluated. One hundred homogenized beef samples, ranging from 1 to 23% fat, wrapped in polyamide/polyethylene laminates, were used. Results of multivariate calibration and prediction for protein, fat, and water contents are presented. The optimal test set prediction errors (root mean square error of prediction, RMSEP), obtained with the use of the principal component regression method with NIR data, were 0.45, 0.29 and 0.50 weight % for protein, fat, and water, respectively, for plastic-wrapped meat (compared to 0.40, 0.28 and 0.45 wt % for unwrapped meat). The optimal prediction errors for the NIT method were 0.31, 0.52 and 0.42 wt % for protein, fat, and water, respectively, for plastic-wrapped meat samples (compared to 0.27, 0.38, and 0.37 wt % for unwrapped meat). We can conclude that the addition of the laminate only slightly reduced the abilities of the NIR and NIT method to predict protein, fat, and water contents in homogenized meat.

Download Full-text

Identification of Liquor Brands Based on near Infrared Spectroscopy

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.834-836.935 ◽

2013 ◽

Vol 834-836 ◽

pp. 935-938

Author(s):

Lian Shun Zhang ◽

Chao Guo ◽

Bao Quan Wang

Keyword(s):

Principal Component Analysis ◽

Infrared Spectroscopy ◽

Near Infrared Spectroscopy ◽

Principal Components ◽

Near Infrared ◽

Scatter Correction ◽

Principal Component ◽

Component Analysis ◽

Correction Method ◽

Variance Contribution

In this paper, the liquor brands were identified based on the near infrared spectroscopy method and the principal component analysis. 60 samples of 6 different brands liquor were measured by the spectrometer of USB4000. Then, in order to eliminate the noise caused by the external factors, the smoothing method and the multiplicative scatter correction method were used. After the preprocessing, we got the revised spectra of the 60 samples. The difference of the spectrum shape of different brands is not much enough to classify them. So the principal component analysis was applied for further analysis. The results showed that the first two principal components variance contribution rate had reached 99.06%, which can effectively represent the information of the spectrums after preprocessing. From the scatter plot of the two principal components, the 6 different brands of liquor were identified more accurate and easier than the spectra curves.

Download Full-text

Principal component articial neural network calibration models for the simultaneous spectrophotometric estimation of mefenamic acid and paracetamol in tablets

Journal of the Serbian Chemical Society ◽

10.2298/jsc0611207s ◽

2006 ◽

Vol 71 (11) ◽

pp. 1207-1218

Author(s):

Dondeti Satyanarayana ◽

Kamarajan Kannan ◽

Rajappan Manavalan

Keyword(s):

Neural Network ◽

Neural Networks ◽

Mefenamic Acid ◽

Principal Component Regression ◽

Principal Component ◽

Simultaneous Estimation ◽

Uv Spectrophotometry ◽

Novel Approach ◽

Marquardt Algorithm ◽

Calibration Models

Simultaneous estimation of all drug components in a multicomponent analgesic dosage form with artificial neural networks calibration models using UV spectrophotometry is reported as a simple alternative to using separate models for each component. A novel approach for calibration using a compound spectral dataset derived from three spectra of each component is described. The spectra of mefenamic acid and paracetamol were recorded as several concentrations within their linear range and used to compute a calibration mixture between the wavelengths 220 to 340 nm. Neural networks trained by a Levenberg-Marquardt algorithm were used for building and optimizing the calibration models using MATALAB? Neural Network Toolbox and were compared with the principal component regression model. The calibration models were thoroughly evaluated at several concentration levels using 104 spectra obtained for 52 synthetic binary mixtures prepared using orthogonal designs. The optimized model showed sufficient robustness even when the calibration sets were constructed from a different set of pure spectra of the components. The simultaneous prediction of both components by a single neural network with the suggested calibration approach was successful. The model could accurately estimate the drugs, with satisfactory precision and accuracy, in tablet dosage with no interference from excipients as indicated by the results of a recovery study.

Download Full-text