• Spectroscopy and Spectral Analysis
  • Vol. 42, Issue 9, 2947 (2022)
Rui LI1、1;, Bo LI1、1; *;, Xue-wen WANG1、1;, Tao LIU1、1;, Lian-jie LI1、1; 2;, and Shu-xiang FAN2、2;
Author Affiliations
  • 11. College of Mechanical and Vehicle Engineering, Taiyuan University of Technology, Taiyuan 030024, China
  • 22. Beijing Research Center of Intelligent Equipment for Agriculture, Beijing 100097, China
  • show less
    DOI: 10.3964/j.issn.1000-0593(2022)09-2947-09 Cite this Article
    Rui LI, Bo LI, Xue-wen WANG, Tao LIU, Lian-jie LI, Shu-xiang FAN. A Classification Method of Coal and Gangue Based on XGBoost and Visible-Near Infrared Spectroscopy[J]. Spectroscopy and Spectral Analysis, 2022, 42(9): 2947 Copy Citation Text show less

    Abstract

    Intelligent recognition of coal and gangue is a new technology that needs to be developed urgently to realize the intelligentization of fully mechanized caving mining. Visible-near infrared spectroscopy technology has many advantages such as environmental friendly and real-time, which meets the requirements of intelligent separation of coal and gangue. The Extreme Gradient Boosting Tree (XGBoost) algorithm which performed well in data science competitions, was introducedto achieve the recognition of coal and gangue based on visible-near infrared spectroscopy. Firstly, a visible-near infrared spectroscopy experimental platform was built to collect the reflectance spectra of lump coal and gangue samples from Shanxi Ximing, Shaanxi Shenmu, and Inner Mongolia Balongtu coal mines in the range of 370~1 049 nm. The collected original spectra were preprocessed through black and white correction, method of removing the start and end bands, Savitzky-Golay (SG) smoothing and standard normal variable transformation (SNV) to reduce the effects of uneven illumination, noise and optical path difference. Secondly, the experimental group and test group were divided according to the difference of reflection spectrum of samples from different mines. The experimental group had a minor difference, which was used to compare the performance of different models and select the best algorithm; the difference of test groups was obvious, which was used to test the performance of the best algorithm in other coal mines and verified the applicability of the algorithm to different coal mines. In the experiment of the experimental group, the coal and gangue classification model was established based on the XGBoost algorithm, and the commonly used machine learning classification algorithms k-nearest neighbor method (KNN), random forest (RF), support vector machine (SVM), which were introduced for comparison. The results showed XGBoost performed best. The average accuracy of 10-fold cross-validation (ACC10), classification accuracy (ACC), and AUC values respectively reached 0.957 2, 0.970 5, and 0.971 6, showing strong stability and classification capabilities. Then in order to reduce the data dimension and calculations, recursive feature elimination (RFE), successive projections algorithm (SPA) and competitive adaptive reweighted sampling (CARS) were used to select the characteristic wavelength and combined with the above four classification algorithms to construct a simplified classification model, respectively. The simplified model of the combination of RFE and XGBoost(RFE-XGB) performed best in the test. The ACC10, ACC, AUC was 0.965 7, 0.980 3, 0.980 3, respectively, and the data dimension reduced to 9. Simplified model improved the stability and classification ability of the model while reducing the data dimension. In the experiment of the test group, the model based on XGBoost and RFE-XGB algorithms can also achieve stable and accurate recognition of coal and gangue in other coal mines, and the simplified model performed better, which was consistent with the results of the experimental group.
    Rui LI, Bo LI, Xue-wen WANG, Tao LIU, Lian-jie LI, Shu-xiang FAN. A Classification Method of Coal and Gangue Based on XGBoost and Visible-Near Infrared Spectroscopy[J]. Spectroscopy and Spectral Analysis, 2022, 42(9): 2947
    Download Citation