• Spectroscopy and Spectral Analysis
  • Vol. 43, Issue 3, 861 (2023)
YIN Xiong-yi1、*, SHI Yuan-bo1, WANG Sheng-jun2, JIAO Xian-he2, and KONG Xian-ming2
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • show less
    DOI: 10.3964/j.issn.1000-0593(2023)03-0861-06 Cite this Article
    YIN Xiong-yi, SHI Yuan-bo, WANG Sheng-jun, JIAO Xian-he, KONG Xian-ming. Quantitative Analysis of Polycyclic Aromatic Hydrocarbons by Raman Spectroscopy Based on ML-PCA-BP Model[J]. Spectroscopy and Spectral Analysis, 2023, 43(3): 861 Copy Citation Text show less

    Abstract

    Pyrene, a kind of polycyclic aromatic hydrocarbons (PAHs), widely exists in the natural environment. It has strong lipophilicity and carcinogenic effect on the human body. Therefore, the rapid analysis of pyrene content in edible oil has far-reaching significance for quality control. The quantitative analysis of polycyclic aromatic hydrocarbons using Raman spectroscopy and artificial intelligence algorithm is a current research hotspot. One milliliter of edible oil is mixed with pyrene liquid with different fixed concentrations to make samples, and then a thin-layer chromatography plate and gold particles are made. The experiment is carried out by combining thin-layer chromatography, and surface-enhanced Raman scattering (SERS) spectrum to obtain the spectral data. The adaptive iterative weighted penalty least square algorithm is selected for preprocessing, Then the Multi parameter-Principal Component Analysis- Back Propagation Neural Network model was used for quantitative analysis. Firstly, two characteristic peaks are selected in the preprocessed spectrum for peak fitting, and the parameters such as height, half-width, height and area of characteristic peaks are obtained. Normalized the Raman data of the two characteristic peaks and the parameters obtained by fitting, and then use the principal component analysis to obtain the key parameters. The obtained key parameters are input into the BP neural network based on L2 regularization as the input layer to output the predicted concentration. The experimental results show that the R2 determination coefficient of the test set is 0.58 and the root mean square error (RMSEC) is 1.85; The linear regression is used to fit the law between the characteristic peak area and pyrene concentration. The final predicted pyrene concentration has an R2 determination coefficient of 0.26, and a root mean square error (RMSEC) of 2.28; For the pyrene concentration predicted by the Multi parameter-Principal Component Analysis-Back Propagation Neural Network model, the R2 determination coefficient of the test set is 0.99, and the root mean square error (RMSEC) is 0.31. The multi-parameter principal component analysis-back propagation neural network model has higher measurement accuracy and less error. The model is aimed at the nonlinear and high-dimensional relationship between spectral data information and sample concentration. The prediction accuracy and modeling efficiency are higher than similar comparison algorithms. The model fits the characteristic peak to obtain the key variables and takes the Raman displacement of the variable and the characteristic peak as the characteristic vector, so the characteristic vector is sufficient. The model uses PCA to extract the nonlinear characteristics of the Raman spectrum and adopts the advantages of strong generalization based on L2 regularization BP neural network to prevent overfitting, so that it can predict the concentration of naphthalene more accurately and quickly.
    YIN Xiong-yi, SHI Yuan-bo, WANG Sheng-jun, JIAO Xian-he, KONG Xian-ming. Quantitative Analysis of Polycyclic Aromatic Hydrocarbons by Raman Spectroscopy Based on ML-PCA-BP Model[J]. Spectroscopy and Spectral Analysis, 2023, 43(3): 861
    Download Citation