• Laser & Optoelectronics Progress
  • Vol. 56, Issue 13, 131007 (2019)
Yongfeng Dong1、2, Yuxin Yang1, and Liqin Wang1、2、*
Author Affiliations
  • 1 School of Artificial Intelligence, Hebei University of Technology, Tianjin 300401, China
  • 2 Hebei Provincial Key Laboratory of Big Data Computing, Tianjin 300401, China
  • show less
    DOI: 10.3788/LOP56.131007 Cite this Article Set citation alerts
    Yongfeng Dong, Yuxin Yang, Liqin Wang. Image Semantic Segmentation Based on Multi-Scale Feature Extraction and Fully Connected Conditional Random Fields[J]. Laser & Optoelectronics Progress, 2019, 56(13): 131007 Copy Citation Text show less
    Single-branch network structure before feature fusion
    Fig. 1. Single-branch network structure before feature fusion
    Multi-scale feature fusion
    Fig. 2. Multi-scale feature fusion
    FullCRF optimization semantic rough segmentation result
    Fig. 3. FullCRF optimization semantic rough segmentation result
    Comparison of segmentation results
    Fig. 4. Comparison of segmentation results
    RGB encoderDepth encoder
    Conv block1:3×3 Conv 643×3 Conv 642×2 maxpoolingConv block2:3×3 Conv 1283×3 Conv 1282×2 maxpoolingConv block3:3×3 Conv 2563×3 Conv 2562×2 maxpoolingConv block1:3×3 Conv 643×3 Conv 642×2 maxpoolingConv block2:3×3 Conv 1283×3 Conv 1282×2 maxpoolingConv block3:3×3 Conv 2563×3 Conv 2562×2 maxpooling
    Conv block4:3×3 Conv 5123×3 Conv 5123×3 Conv 5122×2 maxpoolingConv block5:3×3 Conv 5123×3 Conv 5123×3 Conv 5122×2 maxpoolingConv block4:3×3 Conv 5123×3 Conv 5123×3 Conv 5122×2 maxpoolingConv block5:3×3 Conv 5123×3 Conv 5123×3 Conv 512
    Table 1. Parameter setting table of single branch encoder before feature fusion
    MethodInputdata typePA /%MA /%MIoU /%
    Method in Ref. [6]RGB60.042.229.2
    Method in Ref. [6]Depth57.135.224.2
    Method in Ref. [25]RGB-depth60.3-28.6
    Method in Ref. [26]RGB-depth63.831.5-
    Method in Ref. [6]RGB-depth61.542.430.5
    Method in Ref. [22]RGB-depth65.642.227.8
    MSF-CRFRGB-depth66.944.230.2
    Table 2. Results of different networks on NYUv2 dataset
    DatasetWallFloorCabinetBedChairSofaTableDoor
    FuseNet89.295.767.975.774.671.049.334.8
    MSF-CRF91.896.571.073.773.583.149.527.1
    DatasetWindowBookshelfPictureCounterBlindsDeskShelfCurtain
    FuseNet52.948.068.156.467.215.112.656.5
    MSF-CRF53.860.766.663.545.626.017.358.5
    DatasetDresserPillowMirrorFloormatClothesCeilingBooksFridge
    FuseNet28.444.330.738.822.975.521.211.9
    MSF-CRF45.349.354.919.015.969.210.721.0
    DatasetTVPaperTowelShowerBoxWhite boardPersonNightstand
    FuseNet39.15.723.034.9732.523.235.1
    MSF-CRF50.84.329.630.63.324.349.454.0
    DatasetToiletSinkLampBathtubBagOther structOther furnitureOther prop
    FuseNet75.032.440.151.91.619.810.845.7
    MSF-CRF78.732.940.250.11.09.318.746.8
    Table 3. Comparison of classification accuracy of 40 categories
    DatasetWallFloorCabinetBedChairSofaTableDoor
    FuseNet59.570.844.759.341.247.531.819.6
    MSF-CRF57.270.445.063.743.850.235.415.4
    DatasetWindowBookshelfPictureCounterBlindsDeskShelfCurtain
    FuseNet27.530.044.134.442.511.35.834.8
    MSF-CRF32.730.848.038.536.317.06.143.1
    DatasetDresserPillowMirrorFloormatClothesCeilingBooksFridge
    FuseNet23.729.624.329.58.542.314.88.9
    MSF-CRF32.134.342.517.09.439.89.514.0
    DatasetTVPaperTowelShowerBoxWhite boardPersonNightstand
    FuseNet31.53.818.520.3422.414.826.6
    MSF-CRF39.13.721.826.12.420.732.940.1
    DatasetToiletSinkLampBathtubBagOther structOther furnitureOther prop
    FuseNet49.124.328.841.11.111.17.921.9
    MSF-CRF50.121.231.239.80.97.313.425.0
    Table 4. Comparison of IoU of 40 categories
    Yongfeng Dong, Yuxin Yang, Liqin Wang. Image Semantic Segmentation Based on Multi-Scale Feature Extraction and Fully Connected Conditional Random Fields[J]. Laser & Optoelectronics Progress, 2019, 56(13): 131007
    Download Citation