• Spectroscopy and Spectral Analysis
  • Vol. 42, Issue 3, 984 (2022)
Rui ZHAO1、1;, Hai-yan SONG1、1; *;, Yao ZHAO2、2;, Qin SU1、1;, Wei LI1、1;, Yi-shu SUN1、1;, and Ying-min CHEN1、1;
Author Affiliations
  • 11. College of Agricultural Engineering, Shanxi Agricultural University, Taigu 030800, China
  • 22. College of Horticulture, Shanxi Agricultural University, Taigu 030800, China
  • show less
    DOI: 10.3964/j.issn.1000-0593(2022)03-0984-06 Cite this Article
    Rui ZHAO, Hai-yan SONG, Yao ZHAO, Qin SU, Wei LI, Yi-shu SUN, Ying-min CHEN. Research on Anti-Moisture Interference Soil Organic Matter Model Based on Characteristic Wavelength Integration Algorithm[J]. Spectroscopy and Spectral Analysis, 2022, 42(3): 984 Copy Citation Text show less

    Abstract

    As an important component in soil, soil organic matter (SOM) is a critical nutrition index in the process of crop growth. Rapid and accurate detection of SOM content is of great significance for the fertilization management. In recent years, NIR has been widely used in the rapid detection of SOM. However, soil moisture is one of the important factors that affect the prediction results of SOM. In this study, 140 soil samples were collected in Shanxi Province, and the spectral information with different water content (0%, 5%, 10%, 15%, 17%) was collected by ASD spectrometer (350~2 500 nm). In order to improve the accuracy of the SOM prediction model, a characteristic wavelength integration algorithm (taking the integral absorbance value at characteristic wavelength as the independent variable) was proposed. The results show that: (1) the statistical parameters of the SOM prediction model established by this algorithm are better than the traditional characteristic wavelength modeling method; (2) the moisture correction model established by this algorithm can eliminate the influence of moisture, and the corrected spectra of wet soil samples are closer to the corresponding dry soil samples; (3) the prediction accuracy of wet soil samples is improved. The RP increased by about 0.09 and RMSEP decreased by about 1.72. The results show that the method can effectively reduce the influence of soil moisture on the spectral characteristics of SOM, improve the prediction accuracy of SOM with different water content, and provide theoretical support for the subsequent instrument development.

    Introduction

    Near-infrared spectroscopy (NIRS) has been widely used in analysing and detecting material components in the agricultural field due to its advantages of high efficiency, rapid measurement, and non-destructivity. The soil organic matter (SOM) content is an important indicator to measure soil fertility. The measurement of SOM by NIRS has become an important direction in soil science research.

    At present, the establishment of the SOM prediction model based on near-infrared spectroscopy technology is mainly divided into full band[1,2,3] and characteristic band[1,4-5]. The fullband modeling method will contain more spectral information. However, with the increase of useful information, the irrelevant information redundancy will also increase[1], This not only makes the model too complex, but also is not conducive to the subsequent instrument development. The characteristic band modeling method can significantly reduce the complexity of the model and improve the operation speed. However, because the NIR spectral band is a combination of multiple and combined bands of several different fundamental frequencies, there are a large number of overlapping spectral peaks and shoulder peaks[6], which makes it very difficult to extract the information of NIR spectrum and accurately determine the attribution of the spectral band. Therefore, modeling a certain region band centered on the characteristic wavelength can effectively alleviate the problems caused by single feature wavelength extraction and modeling, and improve the robustness of the model. In addition, due to the obvious absorption peak of soil moisture in the near infrared region, the spectral characteristics of SOM will be covered up[7], so it is necessary to remove the water interference for the prediction of SOM content. Domestic and foreign scholars have put forward some effective methods for the influence of moisture on the spectral measurement of organic matter, such as EPO (external parameter orthogonalization)[8,9,10], DS (direct standardization)[11], and constructing moisture correction coefficient[12].

    Because of the spectral characteristics of NIR spectroscopy and the interference of moisture on the spectral curve of organic matter, the following two works were carried out: (1) A quantitative SOM prediction model was established by taking the integral absorbance value at characteristic wavelength as the independent variable; (2) A new algorithm of soil moisture correction coefficient is constructed based on the integral absorbance value of soil moisture at the characteristic wavelength.

    1 Materials and methods

    1.1 Sample

    In this study, 140 soil samples were collected in Shanxi Province (Taigu, Guandishan, Youyu, Loufan, Ningwu) at a depth of 5~15 cm above the ground. All soil samples were air-dried and divided into two groups evenly. One group was used for SOM content detection. The other was used to prepare soil samples with a moisture content of 0%, 5%, 10%, 15%, and 17%. The SOM content is tested by the potassium dichromate method(National standard method of China, GB9834—88). The statistic results show thatthe maximum content of organic matteris 63.98, the minimum is 4.02, the average is 19.475, and the standard deviation is 14.043.

    1.2 Spectral data collection

    The Field Spec3 spectrometer developed by the American ASD company was used to collect the spectral data of the soil samples. The main working parameters of the instrument are: spectral scanning range is 350~2 500 nm; data interval is 1 nm. After the spectrometer was preheating and stabilizing, soil sample was placed in a petri dish and scraped flat for spectrum collection. The whole spectral scanning process was carried out at room temperature. MATLAB_R2014b software was used for data analysis.

    1.3 Integration algorithm based on characteristic band

    Extraction of characteristic wavelengths can simplify the model, and more importantly, it can eliminate irrelevant or nonlinear variables. However, the wavelength obtained by different extraction methods always fluctuates within a range, and the space with a small fluctuation range is always the effective characteristic band. Therefore, this paper proposes an algorithm that takes the integral absorbance value at characteristic wavelength as the independent variable and establishes the model. It eliminates the local optimization defect of the characteristic wavelength extraction algorithm to a certain extent. The specific steps are as follows:

    (1) Obtain the sensitive wavelength by using spectral characteristic wavelength extraction algorithm;

    (2) Take the extracted characteristic wavelength value as the center, and calculate the approximate area surrounded by the spectral curve by taking a left and right n nm. The area of the rectangle is obtained by taking the difference of adjacent wavelengths as one side of the approximate rectangle and the corresponding absorbance value as the other side;

    (3) Sum of all areas and take it as an independent variable.

    The integration algorithm can be expressed as follows:

    Aλ=i=-mi=mAλ+i×r(1)

    This formula, Aλ represents the absorbance value of the spectrum at the wavelength λ, i is the integration variable, mis the integration range, and ris the data interval of the spectrum curve measured by the spectrometer, here is 1 nm.

    2 Results and discussion

    2.1 Establishment of soil organic matter prediction model

    ASD company’s software View Spec Pro 6.0 was used for spectral preprocessing, and the spectra were obtained after the mean, splicing correction and spectral reflectance logarithm operation.The spectral data of 140 dry soil samples are shown in Fig.1. It can be seen that all samples have typical soil spectral characteristics, the visible light range (350~780 nm) curve shows a clear downward trend, the near-infrared band (780~2 500 nm) spectral curve is relatively flat, and there are obvious absorption peaks near 1 400, 1 950 and 2 200 nm.

    Spectra of all the 140 soil samples

    Figure 1.Spectra of all the 140 soil samples

    2.1.1 Characteristic Wavelength Extraction

    Successive projections algorithm(SPA) is a common and effective method of extracting characteristic wavelengths. This method uses projection analysis of vectors to select effective wavelengths with minimum redundancy and minimum collinearity[13]. Since the noise in the initial range of the instrument is generally high[1], the absorbance value in the range of 400~2 500 nm is selected as the research object in this paper.

    The root means square error (RMSE) distribution curve obtained by extracting characteristic wavelength with SPA and the extraction result of characteristic wavelength are shown in Fig.2. The red block in Fig.2 indicates the number and value of selected characteristic wavelength respectively. The extracted characteristic wavelengths are 627, 425, and 1 796 nm in order of importance, of which 627 nm is consistent with Wang’s two-dimensional spectral extraction results[11], 425 nm is mainly due to the influence of goethite and hematite in soil components[14,15], and 1 796 nm is affected by the content of lignin in soil [16].

    The RMSE and characteristic wavelengths distribution curves after SPA

    Figure 2.The RMSE and characteristic wavelengths distribution curves after SPA

    2.1.2 SOM prediction model

    According to the extracted characteristic wavelengths, the absorbance value of characteristic wavelength and the integral absorbance values at characteristic wavelength are used as independent variables for modeling. The SPXY (sample set partitioning based on joint x-y distance) algorithm divides 110 dry samples for modeling and 30 dry samples for prediction. Since only three characteristic wavelengths are extracted in this study, multiple linear regression (MLR) is selected as the modeling method. In addition, MLR has more advantages in the later instrument development for its simple calculation.

    The equations fitted by the two methods are as follows, which model (Ⅰ) is used the absorbance value of characteristic wavelength and model (Ⅱ) is used the integral absorbance values at a characteristic wavelength. The statistical parameters are shown in Table 1.

    Table Infomation Is Not Enable

    Where y represents the SOM content. A627, A452, A1 796 respectively represent the absorbance values of soil samples at the wavelengths of 627, 452 and 1 796 nm. A627,A452,A1796are the integral absorbance values of soil samples at the wavelengths of 627, 452, and 1 796 nm respectively.

    It can be seen from the statistical parameter table that RP is increased and RMSEP is reduced when the absorbance integral value at characteristic wavelength is taken as the independent variable. Sousing the integral value as an independent variable can make the model more robust.

    2.2 Analysis of spectral prediction results of wet soil

    The average spectra of 140 wet soil samples with different moisture contents are shown in Fig.3. It can be seen from the figure that with the increase of soil moisture, the absorption peaks that appear near the water-sensitive bands of 1 415, 1 929, and 2 210 nm are quite obvious, which is consistent with the research results of Wang[11].

    Average spectra of wet soil samples with different moisture content

    Figure 3.Average spectra of wet soil samples with different moisture content

    2.2.1 Reconstruction of wet soil spectrum

    As soil moisture has obvious absorption peaks around 1 415, 1 929, and 2 210 nm [11], the absorbance values at these three wavelengths construct the soil moisture correction coefficient. The specific algorithm of the correction coefficient is as follows:

    MCC=A2210A1929+A1415×A1929+A1415A2210(2)

    Where MCC represents the moisture correction coefficient, A2 210, A1 929 and A1 415 respectively represent the absorbance values of the wet soil samples at the wavelengths of 2 210, 1 929, and 1 415 nm, A1929,A1415and A2210are the integral absorbance values of the wet soil samples at the wavelengths of 1 929, 1 415, and 2 210 nm respectively, and the corresponding integer values are obtained by formula (1).

    Reconstruct the spectrum of wet soil using the following formula:

    RWSλ=WSλ-MCC×(WSλ-DSλ)n(3)

    Where RWSλ is absorbance values at wavelength λ of samples with different moisture content after correction; WSλ and DSλ are the absorbance values of wet soil and dry soil under corresponding wavelength λ, andnis the number of samples.

    The reconstructed spectral curves are shown in Fig.4(a)—(d) (taking the organic matter content of 32.57 g·kg-1 as an example). It can be seen that the peak value of the reconstructed spectra in the water-sensitive band decreases significantly and is closer to the spectral curve of dry soil. In addition,when the moisture content is high (such as 15%, 17%), the correction effect of this method is better than Hu[17]. The results show that this method can solve the influence of different moisture content on the change of soil spectral absorbance.

    Spectra of reconstructed soil sample in different moisture content (taking the organic matter content of 32.57 g·kg-1 as an example)(a): 5%; (b): 10%; (c): 15%; (d): 17%

    Figure 4.Spectra of reconstructed soil sample in different moisture content (taking the organic matter content of 32.57 g·kg-1 as an example)
    (a): 5%; (b): 10%; (c): 15%; (d): 17%

    2.2.2 Prediction model and evaluation of wet soil

    In order to verify the prediction effect of the two models for wet soil samples, the measured and predicted values of organic matter in wet soil were fitted and analyzed (Fig.5). Among them, Fig.5(a) is model Ⅰ, Fig.5(b) is model Ⅱ. In Fig.5, wet soil samples are marked red, and the corrected wet soil samples are marked with blue. The closer the fitting result is to 1∶1 (line in the figure), the higher the accuracy of model prediction is, and the related statistical parameters are shown in Table 2.

    Wavelength modeling (a) and integral modeling (b) comparison of SOM measured and SOM predicted values

    Figure 5.Wavelength modeling (a) and integral modeling (b) comparison of SOM measured and SOM predicted values

    Table Infomation Is Not Enable

    It can be seen from Fig.5 that the soil moisture correction algorithm can effectively eliminate the moisture affection on SOM prediction. The RP increased by about 0.08 and RMSEP decreased by about 1.63 g·kg-1 in both model Ⅰ and model Ⅱ. Compared with using the absorbance value at characteristic wavelength as the independent variable, the model established by using the integral value as the independent variable has better correlation (RP) and lower error (RMSEP) for prediction results. It shows that model Ⅱ contains more effective information about the SOM than model Ⅰ.

    3 Conclusions

    In this paper, the integral absorbance value at characteristic wavelength was used as an independent variable for modeling, and a soil moisture correction coefficient constructed by an integration algorithm was proposed. The main conclusions are as follows:

    (1) The spectra of soil samples with different moisture content have clearer moisture absorption peaks than dry soil samples, which will mask the characteristic spectral information of SOM and affect the prediction accuracy of SOM content.

    (2) Compared with the traditional characteristic wavelength modeling method, using the integral absorbance value at a characteristic wavelength can cover more effective bands. After verification, the model can be improved to a certain extent by using the integral absorbance value at characteristic wavelength as the independent variable.

    (3) Moisture correction coefficient algorithm using the integral idea was proposed, which can obviously eliminate moisture’s influence.

    (4) The combination of integrated modeling method and moisture correction coefficient can effectively eliminate the influence of soil moisture on the prediction of organic matter content, which can be used as the theoretical basis for the establishment of SOM anti-moisture interference model and instrument development.

    References

    [1] Xiaoli Chu. Practical Handbook of Near Infrared Spectroscopy. China Machine Press(2016).

    [2] Lihua Zhen, Minzan Li, Xiaofei An et al. Transactions of the Chinese Society of Agricultural Engineering, 26, 81(2016).

    [3] Yongchao Tian, Juanjuan Zhang, Xia Yao et al. Transactions of the Chinese Society of Agricultural Engineering, 28, 145(2012).

    [4] Lingyu Yang, Guanwen Li, Xiaohong Gao et al. Soil Bulletin, 48, 1360(2017).

    [5] Yong Zhou, Lei Yu, Yongsheng Hong et al. Transactions of the Chinese Society of Agricultural Engineering, 32, 95(2016).

    [6] Xiang Luo, Huajun Qin, Boping Liu et al. Advanced Measurement and Laboratory Management, 23(2007).

    [7] J Roberts J, D Cozzolino. NIR News, 28, 3(2017).

    [8] Lei Yu, Yongsheng Hong, Yaxing Zhu et al. Acta Pedologica Sinica, 1068(2017).

    [9] K Wang C, Z Pan X. Journal of Near Infrared Spectroscopy, 24, 231(2016).

    [10] Ya Liu, Changkun Wang, Xianzhang Pan et al. PLOS ONE, 10(2015).

    [11] Shi-fang WANG, Hai-yan SONG, Ping HAN et al. Spectroscopy and Spectral Analysis, 39, 1986(2019).

    [12] Lei Zhang, Rongbiao Zhang. Polish Journal of Environmental, 26, 395(2017).

    [13] Yu Li, Guangfei Wei, Zhitao Zhang et al. Peerj, 8, e9087(2020).

    [14] S Grunwald, M L McDowell, M Knox N et al. Geoderma, 239, 229(2015).

    [15] A Viscarra Rossel R, T Behrens. Geoderma, 158, 46(2010).

    [16] Shan-jun LIU, Ni-sha BAO, Wei ZHAO et al. Spectroscopy and Spectral Analysis, 40, 2188(2020).

    [17] Xu CUI, Xiao-yan HU, Xiao-ping HAN et al. Spectroscopy and Spectral Analysis, 39, 1059(2019).

    Rui ZHAO, Hai-yan SONG, Yao ZHAO, Qin SU, Wei LI, Yi-shu SUN, Ying-min CHEN. Research on Anti-Moisture Interference Soil Organic Matter Model Based on Characteristic Wavelength Integration Algorithm[J]. Spectroscopy and Spectral Analysis, 2022, 42(3): 984
    Download Citation