• Spectroscopy and Spectral Analysis
  • Vol. 41, Issue 2, 473 (2021)
Yun-fei SHA1、1、*, Wen HUANG1、1, Liang WANG1、1, Tai-ang LIU1、1, Bao-hua YUE1、1, Min-jie LI1、1, Jing-lin YOU1、1, Jiong GE1、1, and Wen-yan XIE1、1
Author Affiliations
  • 1[in Chinese]
  • 11. Technology Center of Shanghai Tobacco Group Co., Ltd., Shanghai 200082, China
  • show less
    DOI: 10.3964/j.issn.1000-0593(2021)02-0473-05 Cite this Article
    Yun-fei SHA, Wen HUANG, Liang WANG, Tai-ang LIU, Bao-hua YUE, Min-jie LI, Jing-lin YOU, Jiong GE, Wen-yan XIE. Merging MIR and NIR Spectral Data for Flavor Style Determination[J]. Spectroscopy and Spectral Analysis, 2021, 41(2): 473 Copy Citation Text show less

    Abstract

    Tobaccos flavor type’s determination is an important field tobacco industry. In this work, 189 tobacco samples with different flavor were tested by middle infrared (MIR) spectrum and near-infrared (NIR) spectrum. After the test, 21 characteristic absorption value from a certain wavelength in the MIR spectrum and 13 characteristic absorption value from a certain wavelengthin the IR spectrum were selected as main variants. Then the characteristic data extracted from MIR and IR spectrum were submitted to the principal component analysis (PCA), respectively. The PCA pattern showed a poor classification result by using MIR and IR data solely. After that, the MIR and IR variants were submitted to PCA analysis as merged data. The PCA pattern calculated from merged data showed a good classification result. Through the data analysis, there different flavor Style (fen-flavor Style, medium flavor Style and robust flavor Style) can be classified clearly into their category. After PCA analysis, different mathematical algorithms as step-back algorithm and genetic algorithm were applied to select 34 variants that used in PCA model. 24 variants and 19 variants were selected by step-back algorithms and genetic algorithms, respectively. Compared to the projection pattern by using different variant selected by a different algorithm, we found that though the genetic algorithms used the least variants, the classification result is as good as PCA algorithms and step-back algorithms. After that, genetic algorithms were chosen to make projection drawing that separated three different flavors into different planes by using least variants chosen from MIR and IR merged data. Finally, a support vector classification(SVC)model was built to determine different tobacco flavor by using the variants selected by the genetic algorithm. The accuracy of the model was 92.72%, the accuracy in discriminating fen-flavorstyle, medium flavorstyle and robust flavorstyle were 93.75%, 92.11% and 91.84%. The accuracy of predicted outputs was tested by the leave-one-out cross validation (LOOCV). And the accuracy of LOOCV was 88.24%, the accuracy in discriminating fen-flavorstyle, medium flavorstyle and robust flavorstyle were 90.63%, 86.84%, and 87.76%. The accuracy in prediction of the unknown sample was 86.84% and the accuracy in discriminating fen-flavorstyle, medium flavorstyle and robust flavorstyle were 88.24%, 85.71% and 85.71%. The results of accuracy are above 85% in model test, LOOCV teat and the prediction of unknown sample. The result shows that the mixing data from the MIR spectrum and NIR spectrum can provide more information in the mathematical model building and provide an efficient way in fast tobacco flavor discrimination.
    Yun-fei SHA, Wen HUANG, Liang WANG, Tai-ang LIU, Bao-hua YUE, Min-jie LI, Jing-lin YOU, Jiong GE, Wen-yan XIE. Merging MIR and NIR Spectral Data for Flavor Style Determination[J]. Spectroscopy and Spectral Analysis, 2021, 41(2): 473
    Download Citation