• Laser & Optoelectronics Progress
  • Vol. 60, Issue 2, 0210013 (2023)
Zhansheng Tian and Libo Liu*
Author Affiliations
  • School of Information Engineering, Ningxia University, Yinchuan 750021, Ningxia , China
  • show less
    DOI: 10.3788/LOP220453 Cite this Article Set citation alerts
    Zhansheng Tian, Libo Liu. Fine-Grained Image Classification Model Based on Improved Transformer[J]. Laser & Optoelectronics Progress, 2023, 60(2): 0210013 Copy Citation Text show less

    Abstract

    For the characteristics of subtle differences between various subclasses and large differences between same subclasses in a fine-grained image, the existing neural network models have some challenges in processing, including insufficient feature extraction ability, redundant feature representation, and weak inductive bias ability; therefore, an enhanced Transformer image classification model is proposed in this study. First, an external attention is employed to replace the self-attention in the original Transformer model, and the model's feature extraction ability is enhanced by capturing the correlation between samples. Second, the feature selection module is introduced to filter differentiating features and eliminate redundant information to improve feature representation capability. Finally, the multivariate loss is added to improve the model's ability to induce bias, differentiate various subclasses, and fuse the same subclasses. The experimental findings demonstrate that the proposed method's classification accuracy on three fine-grained image datasets of CUB-200-2011, Stanford Dogs, and Stanford Cars reaches 89.8%, 90.2%, and 94.7%, respectively; it is better than that of numerous mainstream fine-grained image classification approaches.
    Zhansheng Tian, Libo Liu. Fine-Grained Image Classification Model Based on Improved Transformer[J]. Laser & Optoelectronics Progress, 2023, 60(2): 0210013
    Download Citation