• Laser & Optoelectronics Progress
  • Vol. 59, Issue 12, 1215011 (2022)
Zhaoliang Meng1、2、3, Zetao Zhang1、*, Yuan Yang2, Guofeng Li3, Chongbo Tao3, and Yijiang Niu3
Author Affiliations
  • 1College of Electronic Information, Xi’an Polytechnic University, Xi’an 710600, Shaanxi , China
  • 2International Engineering College, Xi’an University of Technology, Xi’an 710048, Shaanxi , China
  • 3Power Electronics Division of CRRC Yongji Motor Co., Ltd., Xi’an 710000, Shaanxi , China
  • show less
    DOI: 10.3788/LOP202259.1215011 Cite this Article Set citation alerts
    Zhaoliang Meng, Zetao Zhang, Yuan Yang, Guofeng Li, Chongbo Tao, Yijiang Niu. Improved XGBoost Stray Current Prediction and Interpretable Model[J]. Laser & Optoelectronics Progress, 2022, 59(12): 1215011 Copy Citation Text show less

    Abstract

    To address the issue that there are many characteristics influencing the stray current of a subway track, the conventional feature selection method affects the prediction accuracy of the model, and the interpretability of the model results is poor, this paper proposes a stray current prediction model based on optimal feature improved eXtreme Gradient Boosting (XGBoost). Using the flexibility and the strong searchability of the genetic algorithm, we found the first M features that minimizing the mean square error (MSE) of the objective function generation by generation in the set containing the original V features. Simultaneously, the stray current prediction model under the optimal feature selection method (OFS-XGBoost) is established. To address the issue that the prediction results of the OFS-XGBoost are good, however, the machine learning black-box model has an insufficient explanatory ability for the prediction results, an attribution analysis framework based on SHAP theory is proposed to show the influence of feature set on the prediction results of the model in an understandable way based on the marginal contribution of stray current feature samples to improve the inference accuracy. The results show that the prediction error of the proposed model is only 1.684%, which is lower than the prediction models such as random forest and back propagation (BP) neural network under the same optimization strategy. The attribution analysis method based on SHAP value explains the impact of input characteristics on stray current prediction results from a global and individual perspective, helping intelligent subway health management based on improving model interpretability.
    Zhaoliang Meng, Zetao Zhang, Yuan Yang, Guofeng Li, Chongbo Tao, Yijiang Niu. Improved XGBoost Stray Current Prediction and Interpretable Model[J]. Laser & Optoelectronics Progress, 2022, 59(12): 1215011
    Download Citation