• Optics and Precision Engineering
  • Vol. 31, Issue 11, 1700 (2023)
Yunzuo ZHANG1,2,*, Wei GUO1, and Cunyu WU1
Author Affiliations
  • 1School of Information Science and Technology, Shijiazhuang Tiedao University, Shijiazhuang050043, China
  • 2Hebei Key Laboratory of Electromagnetic Environmental Effects and Information Processing, Shijiazhuang Tiedao University, Shijiazhuang050043, China
  • show less
    DOI: 10.37188/OPE.20233111.1700 Cite this Article
    Yunzuo ZHANG, Wei GUO, Cunyu WU. Fast extraction of buildings from remote sensing images by fusion of CNN and Transformer[J]. Optics and Precision Engineering, 2023, 31(11): 1700 Copy Citation Text show less
    References

    [1] 1徐胜军, 欧阳朴衍, 郭学源, 等. 多尺度特征融合空洞卷积ResNet遥感图像建筑物分割[J]. 光学 精密工程, 2020, 28(7):1588-1599. doi: 10.37188/OPE.20202807.1588XUSH J, OUYANGP Y, GUOX Y, et al. Building segmentation in remote sensing image based on multiscale-feature fusion dilated convolution resnet[J]. Optics and Precision Engineering, 2020, 28(7):1588-1599.(in Chinese). doi: 10.37188/OPE.20202807.1588

    [2] 2王舒洋, 慕晓冬, 杨东方, 等. 融合高阶信息的遥感影像建筑物自动提取[J]. 光学 精密工程, 2019, 27(11): 2474-2483. doi: 10.3788/ope.20192711.2474WANGS Y, MUX D, YANGD F, et al. High-order statistics integration method for automatic building extraction of remote sensing images[J]. Optics and Precision Engineering, 2019, 27(11): 2474-2483.(in Chinese). doi: 10.3788/ope.20192711.2474

    [3] Z X ZHANG, Y H WANG. JointNet: a common neural network for road and building extraction. Remote Sensing, 11, 696(2019).

    [4] X R PAN, F YANG, L R GAO et al. Building extraction from high-resolution aerial imagery using a generative adversarial network with spatial and channel attention mechanisms. Remote Sensing, 11, 917(2019).

    [5] P LUC, C COUPRIE, S CHINTALA et al. Semantic segmentation using adversarial networks. arXiv, 1611-08408(2016). https://arxiv.org/abs/1611.08408

    [6] X Q ZHANG, Z H XIAO, D Y LI et al. Semantic segmentation of remote sensing images using multiscale decoding network. IEEE Geoscience and Remote Sensing Letters, 16, 1492-1496(2019).

    [7] P H LIU, X P LIU, M X LIU et al. Building footprint extraction from high-resolution images via spatial residual inception convolutional neural network. Remote Sensing, 11, 830-848(2019).

    [8] N J HE, L Y FANG, A PLAZA. Hybrid first and second order attention Unet for building segmentation in remote sensing images. Science China Information Sciences, 63, 1-12(2020).

    [9] S X ZHENG, J C LU, H S ZHAO et al. Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers, 6877-6886(2021).

    [10] X ZHAO, J Y GUO, Y T ZHANG et al. Memory-augmented transformer for remote sensing image semantic segmentation. Remote Sensing, 13, 4518(2021).

    [11] Z Y XU, W C ZHANG, T X ZHANG et al. Efficient transformer for remote sensing image segmentation. Remote Sensing, 13, 3585(2021).

    [12] W YUAN, W B XU. MSST-net: a multi-scale adaptive network for building extraction from remote sensing images based on swin transformer. Remote Sensing, 13, 4743(2021).

    [13] K Y CHEN, Z X ZOU, Z W SHI. Building extraction from remote sensing images with sparse token transformers. Remote Sensing, 13, 4441-4462(2021).

    [14] D LI, A B YAO, Q F CHEN. PSConv Squeezing Feature Pyramid into one Compact Poly-scale Convolutional Layer. Computer Vision - ECCV 2020, 615-632(2020).

    [15] E MAGGIORI, Y TARABALKA, G CHARPIAT et al. Can semantic labeling methods generalize to any city? the inria aerial image labeling benchmark, 3226-3229(2017).

    [16] A KHALEL, M EL-SABAN. Automatic pixelwise object labeling for aerial imagery using stacked U-nets. arXiv, 1803-04953(2018). https://arxiv.org/abs/1803.04953

    [17] X LI, X J YAO, Y FANG. Building-A-nets: robust building extraction from high-resolution remote sensing images with adversarial networks. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 11, 3680-3687(2018).

    [18] J J MA, L L WU, X TANG et al. Building extraction of aerial images by a global and multi-scale encoder-decoder network. Remote Sensing, 12, 2350(2020).

    Yunzuo ZHANG, Wei GUO, Cunyu WU. Fast extraction of buildings from remote sensing images by fusion of CNN and Transformer[J]. Optics and Precision Engineering, 2023, 31(11): 1700
    Download Citation