• Spectroscopy and Spectral Analysis
  • Vol. 41, Issue 11, 3474 (2021)
De-ming KONG1、*, Hong-jie CHEN1、1;, Xiao-yu CHEN2、2; *;, Rui DONG1、1;, and Shu-tao WANG1、1;
Author Affiliations
  • 11. School of Electrical Engineering, Yanshan University, Qinhuangdao 066004, China
  • 22. School of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, China
  • show less
    DOI: 10.3964/j.issn.1000-0593(2021)11-3474-06 Cite this Article
    De-ming KONG, Hong-jie CHEN, Xiao-yu CHEN, Rui DONG, Shu-tao WANG. Research on Oil Identification Method Based on Three-Dimensional Fluorescence Spectroscopy Combined With Sparse Principal Component Analysis and Support Vector Machine[J]. Spectroscopy and Spectral Analysis, 2021, 41(11): 3474 Copy Citation Text show less

    Abstract

    The emergence of oil pollution has destroyed the ecological environment. Therefore, the study of oil identification methods is of great significance to the protection of the environment. Petroleum spectrum data can be obtained by fluorescence spectroscopy. At the same time, the spectrum data is preprocessed, and feature information is extracted by dimensionality reduction. Then the pattern recognition algorithm is used for classification, it can realize the qualitative analysis of oil. However, it is vital to study a more efficient way of data dimensionality reduction and recognition algorithms. Based on the three-dimensional fluorescence spectroscopy technology, this paper uses sparse principal component analysis (SPCA) to extract the features of the fluorescence spectrum data measured by the FS920 spectrometer, and the support vector machine (SVM) algorithm applies for classification and recognition, thereby a more efficient oil identification method is obtained. First, seawater and sodium dodecyl sulfate (SDS) was prepared into a micelle solution with a concentration of 0.1 mol·L-1. It was used as a solvent to prepare solutions of 20 different concentrations of 4 kinds of oil: Diesel oil, Jet fuel, Gasoline and Lubricating oil. Then, the three-dimensional fluorescence spectrum was measured by the FS920 spectrometer, and the data schould be preprocessed. Finally, the pre-processed data is extracted using SPCA, and principal component analysis (PCA), and the feature vectors are classified by SVM and K-nearest neighbor (KNN) two pattern recognition algorithms, the classification results of four models PCA-KNN, SPCA-KNN, PCA-SVM and SPCA-SVM are obtained. The research results show that the classification accuracy rates obtained by the four models are 85%, 90%, 90% and 95% respectively. In the same classification algorithm, the classification accuracy obtained by using SPCA is 5% higher than that of PCA. Therefore, SPCA can better highlight the main components in its sparsity, and the sparsity of the load matrix can remove redundant information between variables, achieve the optimization of dimensionality reduction, and provide a better classification for subsequent classification. Effective data feature information; Under the same feature extraction algorithm, the classification accuracy rate obtained by using the SVM algorithm for classification is 5% higher than the accuracy rate obtained by the KNN algorithm, it shows that the SVM algorithm has more advantages in classification. Therefore, this paper uses three-dimensional fluorescence spectroscopy technology combined with SPCA and SVM algorithms to accurately identify petroleum, which provides a new idea for the efficient detection of petroleum pollutants in the future.
    De-ming KONG, Hong-jie CHEN, Xiao-yu CHEN, Rui DONG, Shu-tao WANG. Research on Oil Identification Method Based on Three-Dimensional Fluorescence Spectroscopy Combined With Sparse Principal Component Analysis and Support Vector Machine[J]. Spectroscopy and Spectral Analysis, 2021, 41(11): 3474
    Download Citation