• Spectroscopy and Spectral Analysis
  • Vol. 38, Issue 10, 3058 (2018)
LI Hong-qiang1、2、*, SUN Hong1, and LI Min-zan1
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less
    DOI: 10.3964/j.issn.1000-0593(2018)10-3058-06 Cite this Article
    LI Hong-qiang, SUN Hong, LI Min-zan. Prediction of Soluble Sugar Content in Cabbage by Near Infrared Spectrometer[J]. Spectroscopy and Spectral Analysis, 2018, 38(10): 3058 Copy Citation Text show less

    Abstract

    Soluble sugar is an effective regulator of the taste of vegetables and fruits. It is also a necessary carbohydrate absorbed and used by human beings. The head cabbage is a common vegetable rich in carbohydrates. Soluble sugar content is an important parameter in determining the nutrient quality of head cabbage. Carbohydrates are made up of carbon, hydrogen and oxygen, and the molecular absorption spectra are mainly composed of the combination bands and overtone bands of C—H, O—H and CO groups, and contain abundant organic matter information. The experiment was conducted to study the rapid detection method of soluble sugar content in head cabbage by near infrared spectroscopy and chemometrics. The experiment collected a total of 161 samples of head cabbage. The spectral data were measured by the MATRIX-I FT-NIR spectrometer made in Bruker Company, Germany, and the soluble sugar was measured by the anthrone colorimetric method. Mahalanobis Distance (MD) method and Monte Carlo cross validation (MCCV) method were used to eliminate the abnormal samples. And then the Kennard-Stone (K-S) method was used to divide all samples into a calibration set and a validation set according to the given ratio. All 12 spectral pretreatment methods including Savitzky-Golay convolution smoothing (S-G), first derivative (FD), second derivative (SD), multiple scatter correction (MSC), variable Standardization (SNV), and their combinations, were used to improve the S/N ratio to find the best pretreatment method from them. The competitive adaptive reweighted sampling (CARS) algorithm was used to select and screen out the optimal wavenumbers with the greater absolute values of the regression coefficients in the PLS model, and to remove the wavenumbers with the small regression coefficients. Thus, the best wave number combination related to the nature of the measurement can be selected to get a good calibration model with good robustness and prediction ability. The coefficient of determination (R2), root mean squared error of cross validation (RMSECV), and root mean squared error of prediction (RMSEP) were used to evaluate models. According to the principles of Monte Carlo cross validation method and Mahalanobis distance method, 10 abnormal samples were eliminated, and finally 151 samples were used in modelling. The samples were divided into calibration set (110 samples) and validation set (41 samples) according to 3∶1 by K-S method. Three PLS models were established by using the original spectral data, the preprocessed spectral data, and the spectral data with optimal wavenumbers, respectively. The modeling results showed that the spectral preprocessing method using MSC combined with FD could well improve modeling accuracy, and the R2 of the calibration model increased from 0.68 to 0.93 and was thought to be the best spectral data preprocessing method in this experiment. The CARS method was applied to select optimal wave numbers for modelling. From 12 000 to 10 000 cm-1, there exist O—H str. second overtone and C—H third overtone, and the main background information in this area is water and other groups containing hydrogen. In this region 36 optimal wavenumbers were selected. From 8 500 to 10 000 cm-1, there exist sugar’s and water’s O—H str. first overtone and glucosamine O—H str. first overtone. This region is the main spectral region containing soluble sugar information and is less affected by the background. 15 optimal wavenumbers were selected in this region. The region of 5 800 to 4 000 cm-1 is similar to the region of 12 000 to 10 000 cm-1, and contains 36 selected optimal wavenumbers. Based on the results of the CARS wave number optimization, a full spectrum PLS model and a CARS-PLS model to estimate the head cabbage soluble sugar content were established. The R2, RMSECV, and RMSEP of the full spectrum PLS model were 0.93, 0.157 2%, and 0.132 8%, respectively. While the R2, RMSECV, and RMSEP of the CARS-PLS model were 0.96, 0.076 8%, and 0.059 4%, respectively. The experimental results showed that both CARS-PLS model and full spectrum PLS model had the similar R2, but the RMSECV of the CARS-PLS model was the 1/2 of that of the full spectrum PLS model, and the RMSEP of the CARS-PLS model was also close to 1/2 of that of the full spectrum PLS model. The CARS algorithm reduced the modeling variables so that the complexity of the model was reduced, and the accuracy of CARS-PLS model was improved. The CARS-PLS model is used to predict 41 samples of the validation set. The R2 of the prediction set is 0.86 and the prediction standard error is 0.059 4%, which meant that the prediction model of soluble sugar content in head cabbage was practical. CARS algorithm can reduce the unrelated information and the complexity of the model, and the wavenumbers selected can introduce both the spectra related components information and the spectra related the background information to improve the adaptivity of the calibration model. It provides a new approach for quality evaluation of head cabbage.
    LI Hong-qiang, SUN Hong, LI Min-zan. Prediction of Soluble Sugar Content in Cabbage by Near Infrared Spectrometer[J]. Spectroscopy and Spectral Analysis, 2018, 38(10): 3058
    Download Citation