• Spectroscopy and Spectral Analysis
  • Vol. 41, Issue 4, 1119 (2021)
JIANG Wei-wei1、*, LU Chang-hua1、2, ZHANG Yu-jun2, JU Wei3, WANG Ji-zhou4, OU Chun-sheng1, and XIAO Ming-xia1
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • 3[in Chinese]
  • 4[in Chinese]
  • show less
    DOI: 10.3964/j.issn.1000-0593(2021)04-1119-06 Cite this Article
    JIANG Wei-wei, LU Chang-hua, ZHANG Yu-jun, JU Wei, WANG Ji-zhou, OU Chun-sheng, XIAO Ming-xia. Research on a Quantitative Regression Model of the Infrared Spectrum Based on the Integrated Learning Algorithm[J]. Spectroscopy and Spectral Analysis, 2021, 41(4): 1119 Copy Citation Text show less

    Abstract

    In recent years, deep learning has been studied more and more in the field of data mining, and the integrated learning algorithm in deep learning has been applied to classification and quantitative regression more and more, but the application of integrated learning in the field of infrared spectrum analysis is little. In this paper, an integrated learning quantitative regression algorithm based on Blending model is proposed. GBDT algorithm, linear kernel support vector machine (LinearSVM) and radial kernel support vector machine (RBF SVM) are used as the basic learners, and the prediction results of the basic learners are fused by LinearSVM. The first derivative preprocessing was carried out for the spectral data. The prediction results of the model were analyzed and compared by using the GBDT, LinearSVM, RBF SVM and the Blending integrated learning model respectively. RBF SVM model is the best model for predicting the content of active substance and hardness, R2 is the highest, the RMSEP is the smallest, and the RPD is the largest, and the GBDT model is the worst. The R2 of tablet quality predicted by Blending model is the highest, reaching 0.837 4, while the RMSEP of RBF SVM is the lowest, 2.140 6, and the RPD of RBF SVM, 7.487 8, is the largest. For the boiling point, flash point and total aromatics of diesel oil, Blending model is the best one, which is better than the single model. For the cetane number, GBDT model and RBF SVM model are better than Blending model. For the density property, the single model and the integrated model have better prediction results, except that the R2 of LinearSVM model is 0.944 5, R2 of other models are all higher than 0.99. For the prediction of freezing point properties, RBF SVM and LinearSVM are both better than Blending model. For the prediction of viscosity, only RBF SVM is better than Blending model. It can be seen from the results that the Blending model integrates the characteristics of GBDT, LinearSVM and RBF SVM model, compared with the single model, the prediction of Blending is better or optimal. It is proved that Blending integrated learning model has strong applicability for infrared quantitative regression, and has a high prediction accuracy and generalization ability. It is of great significance for further research on the application of integrated learning algorithm in infrared quantitative regression.
    JIANG Wei-wei, LU Chang-hua, ZHANG Yu-jun, JU Wei, WANG Ji-zhou, OU Chun-sheng, XIAO Ming-xia. Research on a Quantitative Regression Model of the Infrared Spectrum Based on the Integrated Learning Algorithm[J]. Spectroscopy and Spectral Analysis, 2021, 41(4): 1119
    Download Citation