• Spectroscopy and Spectral Analysis
  • Vol. 37, Issue 11, 3409 (2017)
PENG Dan, YUE Jin-xia, and BI Yan-lan
Author Affiliations
  • [in Chinese]
  • show less
    DOI: 10.3964/j.issn.1000-0593(2017)11-3409-05 Cite this Article
    PENG Dan, YUE Jin-xia, BI Yan-lan. Information Extraction of Near Infrared Spectra for Complex Samples Based on Wavelet Packet Transform and Entropy Theory[J]. Spectroscopy and Spectral Analysis, 2017, 37(11): 3409 Copy Citation Text show less

    Abstract

    The method of useful information extraction is one of the most important one in the detection application for complex samples using near infrared (NIR) spectroscopy. Due to various noise, baseline drift, peak overlapping and complex background, informative spectra were hidden and cannot be extracted by conventional methods. Thus, a new hybrid algorithm (EWPIE) was proposed for informative spectra extraction based on wavelet packet transform (WPT) and entropy theory in this study. In EWPIE algorithm, WPT algorithm and its reconstruction algorithm were adopted to split the raw spectra into serval subspectra in different frequency bands. To take advantage of the multiscale property of NIR spectroscopy, each subspectra was further processed through entropy-based phase and orthogonal signal correction (OSC)-based phase. In entropy-based phase, the information entropy theory was used as a filter to remove the interference, which are uncorrelated to analyte contents and would increase the uncertainty of whole spectra. According to the variation of entropy value, some subspectra representing basedline in low frequency band and some subspectra representing noise in high frequency band were filtered out. In OSC-based phase, the OSC algorithm was applied to each of the remaining subspectra and the useful spectra were obtained with accumulation of all the OSC-filtered subspectra. To validate this algorithm, a real NIR spectral dataset of milk was prepared to extract the correlated spectra about the content of fat and protein. Using the EWPIE-filtered spectra, a series of PLS prediction model were constructed. Experimental results show that the prediction ability and robustness of PLS_based prediction models developed with EWPIE algorithm are superior to those developed by conventional algorithms. The root mean square errors of the prediction models for fat and protein can reach up to 0.132% and 0.121%, which indicates that the EWPIE algorithm is a promise tool to extract the useful information from NIR spectra and has certain theoretical significance and practical application value for detection in complex systems.
    PENG Dan, YUE Jin-xia, BI Yan-lan. Information Extraction of Near Infrared Spectra for Complex Samples Based on Wavelet Packet Transform and Entropy Theory[J]. Spectroscopy and Spectral Analysis, 2017, 37(11): 3409
    Download Citation