• Spectroscopy and Spectral Analysis
  • Vol. 41, Issue 11, 3331 (2021)
Yan-kun LI1、1; *;, Ru-nan DONG1、1;, Jin ZHANG2、2;, Ke-nan HUANG3、3;, and Zhi-yi MAO4、4;
Author Affiliations
  • 11. Department of Environmental Science and Engineering, North China Electric Power University, Hebei Key Lab of Power Plant Flue Gas Multi-Pollutants Control, Baoding 071003, China
  • 22. School of Food Science, Guizhou Medical University, Guiyang 550025, China
  • 33. The 82nd Army Group Hospital of the Chinese People’s Liberation Army, Baoding 071000, China
  • 44. Tianjin Building Material Science Research Academy, Tianjin 300110, China
  • show less
    DOI: 10.3964/j.issn.1000-0593(2021)11-3331-08 Cite this Article
    Yan-kun LI, Ru-nan DONG, Jin ZHANG, Ke-nan HUANG, Zhi-yi MAO. Variable Selection Methods in Spectral Data Analysis[J]. Spectroscopy and Spectral Analysis, 2021, 41(11): 3331 Copy Citation Text show less

    Abstract

    How to extract useful information from massive or high-dimensional data is a huge challenge for current data analysis and a hot spot of current research. Variable selection technology can extract feature information variables from numerous and complex measurement data, and achieve the purpose of simplifying multivariate model and even improving the model’s prediction performance. In spectral analysis, the measurement data will inevitably contain interference and irrelevant information variables and the multicollin earity among variables, which will affect the robustness and prediction ability of the model. Therefore, the variable(wavelength) selection methods have progressed greatly in the research and application of spectral analysis. Based on the related pieces of literature and the author’s research experiences, this paper summarizes the proposals, characteristics, developments, categories, comparisons and applications in recent five yearsof methods for selecting variables not only in near-infrared spectra area but also in fields of mid-infrared spectra, Raman spectra and other spectra. The parameters as their criteria or thresholds for evaluating the importance of variables and the strategies or tracks of selecting variables are vital. Moreover, each method has its advantages and limitations. In practice, it is necessary to select the appropriate method according to the characteristics of boththe method and the object. Key contents: (1) Compared the wavelength selection, and wavelength interval selection methods; (2) Summarized the different variable selection methods based on PLS model parameters; (3) Classified and overviewed the variable selection methods according to the strategiesof searching and selection of variables. Finally, we discuss the problems of variable selection methods (such as overfitting and instability etc.) appearing in the actual system and the corresponding solutions. Meantime, there look forward to the research trend, development prospect and application direction of the variable selection methods. Among them, new criteria for evaluating the importance and new selection strategy of variables still require further research. It is expected that this paper will play a positive role in promoting the follow-up researches and applications of variable selection technology.
    Yan-kun LI, Ru-nan DONG, Jin ZHANG, Ke-nan HUANG, Zhi-yi MAO. Variable Selection Methods in Spectral Data Analysis[J]. Spectroscopy and Spectral Analysis, 2021, 41(11): 3331
    Download Citation