• Optical Instruments
  • Vol. 45, Issue 2, 46 (2023)
Bing WANG, Qi HU*, and Yalin BIAN
Author Affiliations
  • School of Optical-Electrical and Computer Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
  • show less
    DOI: 10.3969/j.issn.1005-5630.2023.002.006 Cite this Article
    Bing WANG, Qi HU, Yalin BIAN. An image semantic segmentation algorithm with a two-branch structure[J]. Optical Instruments, 2023, 45(2): 46 Copy Citation Text show less

    Abstract

    Image semantic segmentation requires fine detail information and rich semantic information, but in the stage of feature extraction, continuous down-sampling operation will lead to the loss of spatial details of objects in the image. To solve this problem, a semantic segmentation algorithm based on double-branch structure is proposed, which can obtain rich semantic information effectively and reduce the loss of object details in feature extraction stage. One branch of the algorithm uses shallow network to retain high-resolution detail information which is helpful for object edge segmentation, and the other branch uses deep network for downsampling to obtain semantic information which is helpful for object category recognition, and then the effective fusion of the two kinds of information can generate accurate pixel prediction. Experimental results on Cityscapes and CamVid datasets show that the proposed algorithm achieves better segmentation performance under fewer parameters than existing semantic segmentation algorithms.
    $ \alpha =\sigma \left({f}^{1\times 1}\left(X\right)\right) $(1)

    View in Article

    $ {X}_{\mathrm{S}\mathrm{A}}={f}_{\mathrm{S}\mathrm{A}}\left(X,\alpha \right) $(2)

    View in Article

    $ {\beta }_{\rm{avg}}=\sigma \left({W}_{2}\delta \left({W}_{1}\left(GlobalAvgPool\left(X\right)\right)\right)\right) $(3)

    View in Article

    $ {X}_{\rm{avg}}={f}_{\mathrm{C}\mathrm{A}}\left(X,{\beta }_{\rm{avg}}\right) $(4)

    View in Article

    $ {\beta }_{\rm{max}}=\mathrm{\sigma }\left({W}_{2}\mathrm{\delta }\left({W}_{1}\left(GlobalMaxPool\left({X}\right)\right)\right)\right) $(5)

    View in Article

    $ {X}_{\rm{max}}={f}_{\mathrm{C}\mathrm{A}}\left(X,{\beta }_{\rm{max}}\right) $(6)

    View in Article

    $ lr=baselr\times {\left(1-\frac{iter}{max\_iter}\right)}^{power} $(7)

    View in Article

    $ mIoU=\frac{1}{k+1}{\sum }_{i=0}^{k}\frac{{p}_{ii}}{{\displaystyle\sum }_{j=0}^{k}{p}_{ij}+{\displaystyle\sum }_{j=0}^{k}{p}_{ji}-{p}_{ii}} $(8)

    View in Article