• Spectroscopy and Spectral Analysis
  • Vol. 42, Issue 6, 1721 (2022)
Ming-xuan WANG*, Qiao-yun WANG*;, Fei-fei PIAN, Peng SHAN, Zhi-gang LI, and Zhen-he MA
Author Affiliations
  • College of Information Science and Engineering, Northeastern University, Shenyang 110819, China
  • show less
    DOI: 10.3964/j.issn.1000-0593(2022)06-1721-07 Cite this Article
    Ming-xuan WANG, Qiao-yun WANG, Fei-fei PIAN, Peng SHAN, Zhi-gang LI, Zhen-he MA. Quantitative Analysis of Diabetic Blood Raman Spectroscopy Based on XGBoost[J]. Spectroscopy and Spectral Analysis, 2022, 42(6): 1721 Copy Citation Text show less
    Original Raman spectra of blood samples
    Fig. 1. Original Raman spectra of blood samples
    Schematic diagram of XGBoost algorithm
    Fig. 2. Schematic diagram of XGBoost algorithm
    XGBoost Hyper-parameters tuning
    Fig. 3. XGBoost Hyper-parameters tuning
    Model prediction regression diagram(a): DT model; (b): SVR model; (c): XGBoost model; (d): RF model
    Fig. 4. Model prediction regression diagram
    (a): DT model; (b): SVR model; (c): XGBoost model; (d): RF model
    Clarke grid error analysis diagram(a): DT Clark grid error analysis; (b): SVR Clark grid error analysis; (c): XGBoost Clark grid error analysis; (d): RF Clark grid error analysis
    Fig. 5. Clarke grid error analysis diagram
    (a): DT Clark grid error analysis; (b): SVR Clark grid error analysis; (c): XGBoost Clark grid error analysis; (d): RF Clark grid error analysis
    Algorithm: Exact Greedy Algorithm for Split Finding
    Input: I, instance set of current node
    Input: d, feauure dimension
    gain0GiIgi,HiIhi
    for k=1 to m do
    GL←0, HL←0
    for j in sorted (I, by xjk) do
    GLGL+gj, HLHL+hj
    GRG-GL, HRH-HL
    score←maxscore,GL2HL+λ+GR2HR+λ-G2H+λ
    end
    Output: Split with max score
    Table 1. Schematic diagram of greedy algorithm
    超参数超参数含义数值
    learning_rate在迭代更新中使用步长收缩防止过度拟合0.3
    n_estimators模型中增加树的数量150
    max_depth树的最大深度7
    min_child_weight在树构建过程中进一步划分叶节点5
    subsample每棵树对样本的采样率1
    colsample_bytree列采样率1
    gamma损失函数比较值0
    reg_alphaL1正则化项权重0
    reg_lambdaL2正则化权重项1
    Table 2. Hyper-parameters
    模型R2RMSECRMSEPRPD
    RF0.873 010.841 340.981 952.370 43
    SVR0.821 990.953 561.067 792.805 87
    XGBoost0.999 990.007 490.007 17331.973 18
    DT0.908 570.960 410.878 493.308
    Table 3. Model evaluation
    Ming-xuan WANG, Qiao-yun WANG, Fei-fei PIAN, Peng SHAN, Zhi-gang LI, Zhen-he MA. Quantitative Analysis of Diabetic Blood Raman Spectroscopy Based on XGBoost[J]. Spectroscopy and Spectral Analysis, 2022, 42(6): 1721
    Download Citation