• Spectroscopy and Spectral Analysis
  • Vol. 43, Issue 2, 638 (2023)
LI Xiao1, CHEN Yong2, MEI Wu-jun3, WU Xiao-hong2, FENG Ya-jie1, and WU Bin4
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • 3[in Chinese]
  • 4[in Chinese]
  • show less
    DOI: 10.3964/j.issn.1000-0593(2023)02-0638-06 Cite this Article
    LI Xiao, CHEN Yong, MEI Wu-jun, WU Xiao-hong, FENG Ya-jie, WU Bin. Classification of Tea Varieties Using Fuzzy Covariance Learning Vector Quantization[J]. Spectroscopy and Spectral Analysis, 2023, 43(2): 638 Copy Citation Text show less

    Abstract

    Tea, with much nutrition, is one of the most popular drinks in the world. Good and bad tea are mixed at the market, so it is difficult to make a classification among them. Therefore, using a fast and accurate method to identify tea varieties is meaningful. Most chemical compound’s fundamental frequency absorption bands are within the wavelength range of 2 500~25 000 nm (Mid-infrared region). Large amounts of feature discriminant information in the mid-infrared spectra of tea can be applied to classify tea varieties. This paper proposed a fuzzy covariance the learning vector quantization (FCLVQ) based on the Gustafson-Kessel (GK) clustering. It introduces learning rate of learning vector quantization (LVQ) to control the update rate of cluster centers. Combined with mid-infrared spectroscopy, FCLVQ realizes fast and accurate identification of tea varieties by iteratively calculating the fuzzy membership values and fuzzy clustering centers of samples. Three different kinds of tea(i. e. Emeishan tea, high-quality bamboo-leaf-green tea and low-quality bamboo-leaf-green tea) were selected as 96 samples in total at the market. Each variety corresponds to one group, which consists of 32 samples. The Fourier mid-infrared spectra were collected using an FTIR-7600 spectrometer, and the average spectral data were computed as the final experimental spectra. Firstly, the original spectral data contained noise data, so they were pretreated with multiplicative scattering correction(MSC) to reduce noise. Secondly, principal component analysis(PCA) was employed to reduce the dimensionality of data from 1 868 to 14, and the cumulative contribution of the 14 principal components was 99.74%.Thirdly, the dimensionality of the processed data was reduced to 2, and the discriminant information was extracted by linear discriminant analysis(LDA). Finally, fuzzy C-means clustering(FCM) was run to get initial cluster centers for FCLVQ. The experimental results showed that when the weight index m=2, the accuracy rate of FCLVQ was 95.25%. On the condition of m=2, for the same spectra, the classification accuracy rates of FCM, GK and fuzzy Kohonen clustering(FKCN) were 90.91%, 92.41% and 90.91% respectively. The experimental results showed that compared with the other three algorithms, FCLVQ had a better classification accuracy when m=2 and the number of principal components were 14. Thus, it can be used to classify different tea varieties.
    LI Xiao, CHEN Yong, MEI Wu-jun, WU Xiao-hong, FENG Ya-jie, WU Bin. Classification of Tea Varieties Using Fuzzy Covariance Learning Vector Quantization[J]. Spectroscopy and Spectral Analysis, 2023, 43(2): 638
    Download Citation