• Spectroscopy and Spectral Analysis
  • Vol. 43, Issue 5, 1471 (2023)
CHEN Dong-ying11, ZHANG Hao1, ZHANG Zi-long2, YU Mu-xin2, and CHEN Lu3
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • 3[in Chinese]
  • show less
    DOI: 10.3964/j.issn.1000-0593(2023)05-1471-07 Cite this Article
    CHEN Dong-ying1, ZHANG Hao, ZHANG Zi-long, YU Mu-xin, CHEN Lu. Research on the Origin Traceability of Honeysuckle Based on Improved 1D-VD-CNN and Near-Infrared Spectral Data[J]. Spectroscopy and Spectral Analysis, 2023, 43(5): 1471 Copy Citation Text show less

    Abstract

    Honeysuckle is an essential medicine for clearing away heat and detoxifying. However, the sources of honeysuckle on the market are complicated, and the most famous honeysuckle produced in Pingyi, Shandong, is often counterfeited. Most existing identification methods are time-consuming, costly, and complex to operate. Therefore, a fast and efficient way to trace the honeysuckle’s origin is urgently needed. The current one-dimensional convolutional neural network (1D-CNN) identification model based on honeysuckle near-infrared spectroscopy (NIRS) data has the problems of too many parameters and too low model efficiency, high computational complexity and is prone to overfitting. This paper improves the traditional 1D-CNN structure. We use the more efficient VD (Very Deep) structure to replace the hidden layer structure in the conventional 1D-CNN and make adaptive improvements for NIRS data so that the model can be directly applied to one-dimensional NIRS data. The improvement method is divided into three steps: firstly, the design of the feature layer is converted into two constraints optimization design: the first constraint is to set the C value of each convolution layer (the ratio of the size of the convolution kernel and the receptive field) to 1/6, which can improve the efficiency of the network model; the second constraint is to take the size of the top-level sensory domain as the size of the data vector, which can achieve feature extraction of deeper data and reduce overfitting. Secondly, this design minimizes the output feature vector of the feature layer to a smaller size through the downsampling operation. Finally, use two convolutional layers of size 1×5 and a pooling layer with dropout to downsample the data size to a vector of only one vector instead of a fully connected layer for classification, thereby reducing the number of parameters. In the experiment, 500 honeysuckles samples were collected from the main producing areas of Henan, Shandong, Hebei, and Chongqing. The spectral range used in the test is 908~1 676 nm. The sample set was preprocessed by the KS algorithm, and the training set, validation set and test set were divided by the shuffle algorithm. At last, a honeysuckle origin identification model based on improved 1D-VD-CNN and near-infrared spectroscopy was constructed. The results show that the 1D-VD-CNN training set and test set’s accuracy reach 100%, and the loss value converges around 0.001. Compared with the traditional 1D-CNN model, the training set and test set accuracy of the 1D-VD-CNN model are improved by about 0.5% and 1.4%, respectively, and the number of parameters and FLOPs are reduced by nearly 1 M and 20 M, respectively. At the same time, compared with the original spectral data analysis method and the PLS-DA method, it shows that the 1D-VD-CNN model has higher efficiency and better recognition performance for honeysuckle near-infrared spectral classification.
    CHEN Dong-ying1, ZHANG Hao, ZHANG Zi-long, YU Mu-xin, CHEN Lu. Research on the Origin Traceability of Honeysuckle Based on Improved 1D-VD-CNN and Near-Infrared Spectral Data[J]. Spectroscopy and Spectral Analysis, 2023, 43(5): 1471
    Download Citation