• Spectroscopy and Spectral Analysis
  • Vol. 40, Issue 12, 3772 (2020)
Shan-ke HU1、1、*, Yu-hua QIN1、1, Ru-min DUAN1、1, Li-jun WU1、1, and Hui-li GONG1、1
Author Affiliations
  • 1[in Chinese]
  • 11. College of Information Science and Technology, Qingdao University of Science and Technology, Qingdao 266061, China
  • show less
    DOI: 10.3964/j.issn.1000-0593(2020)12-3772-06 Cite this Article
    Shan-ke HU, Yu-hua QIN, Ru-min DUAN, Li-jun WU, Hui-li GONG. Research on Feature Extraction of Near-Infrared Spectroscopy Based on Joint Matrix Local Preserving Projection[J]. Spectroscopy and Spectral Analysis, 2020, 40(12): 3772 Copy Citation Text show less

    Abstract

    Aiming at the problem that the high-dimensional, high-noise, overlap and nonlinear features of the near-infrared spectrum seriously affect the modeling accuracy, a feature extraction method based on joint matrix local preservation projection (JMLPP) is proposed in this paper. First, the cluster-based spectral feature selection is used for effective features extraction. According to kinds of indicators with a strong correlation of classification, the samples are divided into kinds of different clustering modes. Based on the idea of strong intra-class correlation and great inter-class difference, the intra-class threshold and the inter-class threshold are determined by adjusting the intra-class parameter and the inter-class parameter . The spectral feature regions are selected according to kinds of different clustering modes, and feature matrices are obtained, whereas a joint matrix is generated by the union operation. Cluster-based feature extraction eliminates features with low intra-class correlation and high correlation between classes, and realizes the elimination of noise information in the spectrum. Secondly, the local preservation projection algorithm (LPP) is improved in this paper from two aspects: the geodesic distance is introduced to construct the neighborhood distance matrix, and the topology between the high-dimensional sample data is better expressed than the Euclidean distance. Meanwhile, the edge weight matrix is also improved, which solves the uncertainty caused by sample sparseness and avoids the loss of effective information. Finally, the improved LPP algorithm is used to reduce the dimensionality of the joint matrix, and the optimal spectral feature subset of the low-dimensional mapping is obtained. In order to verify the effectiveness of the JMLPP algorithm, this paper first compares the JMLPP with PCA and LPP from the perspective of spectral projection. The results show that JMLPP has better classification ability, and the tobacco samples in the projection space are clearly classified, and the effect is obviously better than PCA and LPP. In addition, the results of the model classification are also compared. The classification models were established by using the full spectra and dimension reduction features of the PCA, LPP and JMLPP. The experimental results show that the accuracy of the classification model established by JMLPP algorithm is 93.8%. The sensitivity of the five categories of tobacco grading classification are 95.2%, 93.1%, 94.2%, 92.1%, 92.5%, and the specificities are 99.3%, 98.4%, 98.6%, 97.5%, and 97%, respectively. The accuracy, sensitivity and specificity of the model are significantly higher than the other three methods. The JMLPP algorithm effectively extracts useful information of classification based on cluster-based feature extraction and local preserving projection algorithm, and maintains the local linear relationship of the original data. The stability and accuracy of model are desirable.
    Shan-ke HU, Yu-hua QIN, Ru-min DUAN, Li-jun WU, Hui-li GONG. Research on Feature Extraction of Near-Infrared Spectroscopy Based on Joint Matrix Local Preserving Projection[J]. Spectroscopy and Spectral Analysis, 2020, 40(12): 3772
    Download Citation