• Spectroscopy and Spectral Analysis
  • Vol. 34, Issue 10, 2868 (2014)
DENG Ling-li1、2、*, Cheng Kian-Kai3, SHEN Gui-ping1, ZHOU Ling1, LIU Xin-zhuo1, DONG Ji-yang1, and CHEN Zhong1
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • 3Department of Bioprocess Engineering & Institute of Bioproduct Development, Universiti Teknologi Malaysia, Skudai81310, Malaysia
  • show less
    DOI: 10.3964/j.issn.1000-0593(2014)10-2868-05 Cite this Article
    DENG Ling-li, Cheng Kian-Kai, SHEN Gui-ping, ZHOU Ling, LIU Xin-zhuo, DONG Ji-yang, CHEN Zhong. A Novel Metabolomic Data Scaling Method Based on K-L Divergence[J]. Spectroscopy and Spectral Analysis, 2014, 34(10): 2868 Copy Citation Text show less

    Abstract

    A new scaling method in the current study based on Kullback-Leibler (K-L) divergence is proposed for NMR metabolomic data. The proposed method (called K-L scaling) is a supervised scaling method as group information is incorporated in the scaling procedure. Notably, K-L divergence measures the difference between two different datasets by their probability distributions, it can be used for the analysis of data that either follows Gaussian or non-Gaussian distributions. In K-L scaling, all variables were first standardized to unit variance, then their variance was adjusted using Kullback-Leibler divergence to highlight the significant variables. K-L scaling can tell effectively the difference in spectral data points between two experimental groups, and then enhances the weights of biological-relevant variables, and at the same time reduces the weight of noise and uninformative variables. The developed method was applied to a 1H-NMR metabolomic dataset acquired from human urine. Analysis results of the dataset showed that this new scaling method is efficient in suppressing the contribution of noise in the resulting multivariate model. In addition, it can increase the weights of important variables, and improve the interpretability and predictability of subsequent principal component regression (PCR) and partial least squares discriminant analysis (PLS-DA). Furthermore, the scaling method facilitated the identification of metabolic signatures. The current result suggested that the developed K-L scaling method may become a useful alternative for the preprocessing of NMR-based metabolomic data.
    DENG Ling-li, Cheng Kian-Kai, SHEN Gui-ping, ZHOU Ling, LIU Xin-zhuo, DONG Ji-yang, CHEN Zhong. A Novel Metabolomic Data Scaling Method Based on K-L Divergence[J]. Spectroscopy and Spectral Analysis, 2014, 34(10): 2868
    Download Citation