• Journal of Innovative Optical Health Sciences
  • Vol. 16, Issue 4, 2243001 (2023)
Guoping Xu1, Xuan Zhang1, Wentao Liao1, Shangbin Chen2, and Xinglong Wu1、2、*
Author Affiliations
  • 1School of Computer Science & Engineering, Hubei Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan, Hubei 430205, P. R. China
  • 2Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics-Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
  • show less
    DOI: 10.1142/S1793545822430015 Cite this Article
    Guoping Xu, Xuan Zhang, Wentao Liao, Shangbin Chen, Xinglong Wu. LGNet: Local and global representation learning for fast biomedical image segmentation[J]. Journal of Innovative Optical Health Sciences, 2023, 16(4): 2243001 Copy Citation Text show less

    Abstract

    Medical image segmentation plays a crucial role in clinical diagnosis and therapy systems, yet still faces many challenges. Building on convolutional neural networks (CNNs), medical image segmentation has achieved tremendous progress. However, owing to the locality of convolution operations, CNNs have the inherent limitation in learning global context. To address the limitation in building global context relationship from CNNs, we proposeLGNet, a semantic segmentation network aiming to learn local and global features for fast and accurate medical image segmentation in this paper. Specifically, we employ a two-branch architecture consisting of convolution layers in one branch to learn local features and transformer layers in the other branch to learn global features. LGNet has two key insights: (1) We bridge two-branch to learn local and global features in an interactive way; (2) we present a novel multi-feature fusion model (MSFFM) to leverage the global contexture information from transformer and the local representational features from convolutions. Our method achieves state-of-the-art trade-off in terms of accuracy and efficiency on several medical image segmentation benchmarks including Synapse, ACDC and MOST. Specifically, LGNet achieves the state-of-the-art performance with Dice’s indexes of 80.15% on Synapse, of 91.70% on ACDC, and of 95.56% on MOST. Meanwhile, the inference speed attains at 172 frames per second with 224×224 input resolution. The extensive experiments demonstrate the effectiveness of the proposed LGNet for fast and accurate for medical image segmentation.
    Guoping Xu, Xuan Zhang, Wentao Liao, Shangbin Chen, Xinglong Wu. LGNet: Local and global representation learning for fast biomedical image segmentation[J]. Journal of Innovative Optical Health Sciences, 2023, 16(4): 2243001
    Download Citation