• Laser & Optoelectronics Progress
  • Vol. 59, Issue 18, 1811003 (2022)
Rongfen Zhang*, Wenhao Yuan, Jin Lu, and Yuhong Liu
Author Affiliations
  • College of Big Data and Information Engineering, Guizhou University, Guiyang 550025, Guizhou , China
  • show less
    DOI: 10.3788/LOP202259.1811003 Cite this Article Set citation alerts
    Rongfen Zhang, Wenhao Yuan, Jin Lu, Yuhong Liu. Visual Simultaneous Localization and Mapping Method of Semantic Octree Map Toward Indoor Dynamic Scenes[J]. Laser & Optoelectronics Progress, 2022, 59(18): 1811003 Copy Citation Text show less
    Structure of proposed vSLAM system
    Fig. 1. Structure of proposed vSLAM system
    Network structure of FAST-SCNN[10]
    Fig. 2. Network structure of FAST-SCNN[10]
    Result of semantic segmentation. (a) Original drawing; (b) segmented output
    Fig. 3. Result of semantic segmentation. (a) Original drawing; (b) segmented output
    Outliers removing. (a) All feature points; (b) effect after removing moving points
    Fig. 4. Outliers removing. (a) All feature points; (b) effect after removing moving points
    Diagram of octree map
    Fig. 5. Diagram of octree map
    Semantic octree map building process
    Fig. 6. Semantic octree map building process
    Dining Rooms image sequence and its semantic segmentation results
    Fig. 7. Dining Rooms image sequence and its semantic segmentation results
    Point cloud. (a) Original point cloud; (b) filtered point cloud
    Fig. 8. Point cloud. (a) Original point cloud; (b) filtered point cloud
    Semantic point cloud and semantic octree map. (a) Semantic point cloud; (b) semantic octree map
    Fig. 9. Semantic point cloud and semantic octree map. (a) Semantic point cloud; (b) semantic octree map
    Comparison of ATE. (a) ORB-SLAM2; (b) proposed system
    Fig. 10. Comparison of ATE. (a) ORB-SLAM2; (b) proposed system
    Comparison of relative translation error. (a) ORB-SLAM2; (b) proposed system
    Fig. 11. Comparison of relative translation error. (a) ORB-SLAM2; (b) proposed system
    Semantic octree map building effect. (a) Original frame sequence; (b) semantic octree map
    Fig. 12. Semantic octree map building effect. (a) Original frame sequence; (b) semantic octree map
    Feature points extraction of proposed system in two scenes
    Fig. 13. Feature points extraction of proposed system in two scenes
    Semantic segmentation results of proposed system in two scenes
    Fig. 14. Semantic segmentation results of proposed system in two scenes
    Moving point removing effect of proposed system in two scenes
    Fig. 15. Moving point removing effect of proposed system in two scenes
    Map building effect of proposed system
    Fig. 16. Map building effect of proposed system

    Algorithm Multi-stage RANSAC Algorithm

    Input: Previous Frame, F1; KeyPoints of F1K; Current Frame, F2;The minimum number of points required, m

    Output: The outliers, O; The inliers, I

    1: assign K to I

    2: for the current stage number i is less than n

    3: compute the maximum number required for iterations, N

    4: select randomly m points from I

    5: while the number of iterations is less than N

    6: solve for F

    7: determine the portion of inliers among K using F

    8: if the portion of inliers is larger than τi

    9: leave the loop

    10: end if

    11: end while

    12: assign new inliers to I

    13: end for

    Table 1. Multi-stage RANSAC algorithm
    τIns.Outs.Time /ms
    0.116868.22814
    0.2228121.02447
    0.3260130.60222
    Table 2. Test results of RANSAC algorithm
    τ1τ2Ins.1Ins.2Outs.Time /ms
    10.129415954.99953
    10.229421040.83729
    10.3294238110.48140
    20.132715979.83135
    20.232721191.63077
    20.3327238110.69094
    Table 3. Test results of Multi-stage RANSAC algorithm
    Dynamic Points Removing Algorithm
    Input: Dynamic Points, O; Semantic mask of most likely moving objects, M; KeyPoints, K
    Output: The set of inliers, I
    1: if M not empty then
    2: for point o m in O M do
    3: if o = m then
    4: remove M from K
    5: leave the loop
    6: end if
    7: end for
    8: end if
    Table 4. Moving point removing algorithm
    Seq.ORB-SLAM2Proposed systemImprovement
    RMSE /mS.D /mRMSE /mS.D /mRMSE /%S.D /%
    WX0.8259810.4494780.0199910.01037497.5897.69
    WH0.5023630.2996150.0259550.01203694.8395.98
    WR1.2122790.6762050.0621670.04196794.8793.79
    WS0.5852810.4237430.0090440.00409798.4599.03
    SX0.0136020.0066880.0126490.0061407.018.19
    SH0.0407320.0232050.0176600.00852156.6463.28
    SR0.0308980.0180590.0211740.01104831.4738.82
    SS0.0120070.0055700.0072370.00372039.7333.21
    Table 5. Typical value of translation
    Seq.ORB-SLAM2Proposed systemImprovement
    RMSE /degS.D /degRMSE /degS.D /degRMSE /%S.D /%
    WX14.8129308.0861170.6186510.37631495.8295.35
    WH13.3791707.2778470.7490630.35454894.4095.13
    WR22.02147212.8580641.2017340.78777494.5493.87
    WS10.3347877.5237540.2561800.11081697.5298.53
    SX0.5780520.2991330.5038810.27463612.838.19
    SH1.0307260.4566380.6524510.33093036.7027.53
    SR0.8821690.4341880.7487900.36218015.1216.58
    SS0.3362920.1448340.2597070.11606622.7719.86
    Table 6. Typical value of rotation
    Seq.ORB-SLAM2Proposed systemImprovement
    RMSE /mS.D /mRMSE /mS.D /mRMSE /%S.D /%
    WX0.5655050.2006910.0149320.00796997.3696.03
    WH0.3279890.1772250.0253570.01316992.2792.57
    WR0.8178790.4302060.0476720.03318094.1792.29
    WS0.4092680.1751140.0069570.00324498.3098.15
    SX0.0092750.0047960.0106740.004922-15.08-2.63
    SH0.0278820.0136920.0140910.00676749.4650.58
    SR0.0215130.0141810.0160520.00960725.3832.25
    SS0.0076980.0036550.0055680.00304727.6716.63
    Table 7. Typical value of ATE
    Seq.ORB-SLAM2Proposed systemImprovement
    RMSE /mS.D /mRMSE /mS.D /mRMSE /%S.D /%
    WX0.0302830.0199770.0199910.01037433.9948.07
    WH0.0318930.0164580.0259550.01203618.6226.87
    WR0.1521890.1169610.0621670.04196759.1564.12
    WS0.0093470.0043580.0090440.0040973.245.99
    SX0.0126920.0066640.0126490.0061400.347.86
    SH0.0174030.0078080.0176600.008221-1.48-5.29
    SR0.0257570.0147380.0211740.01104817.7925.04
    SS0.0073270.0035640.0072370.0035201.291.23
    Table 8. Typical value of translation
    Seq.ORB-SLAM2Proposed systemImprovement
    RMSE /degS.D /degRMSE /degS.D /degRMSE /%S.D /%
    WX0.7662830.5293800.6186510.37631419.2728.91
    WH0.8462960.4220840.7490630.35454811.4916.00
    WR3.0436192.3352321.2017340.78777460.5266.27
    WS0.2550130.1069530.2561800.110816-0.46-3.61
    SX0.4953920.2701480.4938810.2746360.31-1.66
    SH0.6307550.3006570.6284510.2909300.373.24
    SR0.8430540.4680680.7487900.36218011.1822.62
    SS0.2621470.1171040.2597070.1160660.930.89
    Table 9. Typical value of rotation
    Seq.ORB-SLAM2Proposed systemImprovement
    RMSE /mS.D /mRMSE /mS.D /mRMSE /%S.D /%
    WX0.0221800.0144020.0149320.00796932.6844.67
    WH0.0320830.0177150.0253570.01316920.9625.66
    WR0.4338200.2282520.0476720.03318089.0185.46
    WS0.0077090.0032750.0069570.0032449.750.95
    SX0.0103390.0053770.0106740.004922-3.248.46
    SH0.0148160.0066720.0140910.0067674.89-1.42
    SR0.0202420.0126800.0160520.00960720.7024.24
    SS0.0062730.0030850.0055680.00304711.241.23
    Table 10. Typical value of ATE
    Seq.DS-SLAMMultiImprovement
    RMSE /mS.D /mRMSE /mS.D /mRMSE /%S.D /%
    WX0.0302830.0199770.0246520.01357618.5932.04
    WH0.0318930.0164580.0293210.0160078.062.74
    WR0.1521890.1169610.1381370.1105489.235.48
    WS0.0093470.0043580.0095990.004705-2.70-7.96
    SX0.0126920.0066640.0126150.0064420.613.33
    SH0.0174030.0078080.0172830.0072270.697.44
    SR0.0257570.0147380.0212120.0112221.7624.1
    SS0.0073270.0035640.0073150.0038710.16-0.03
    Table 11. Typical value of translation
    Seq.DS-SLAMMultiImprovement
    RMSE /degS.D /degRMSE /degS.D /degRMSE /%S.D /%
    WX0.7662830.5293800.6831580.44073010.8516.75
    WH0.8462960.4220840.8246580.4474382.56-6.01
    WR3.0436192.3352322.7974772.1890818.096.26
    WS0.2550130.1069530.2619050.122951-2.70-14.96
    SX0.4953920.2701480.4886890.2597260.143.86
    SH0.6307550.3006570.6180150.2895670.203.69
    SR0.8430540.4680680.6815210.33369519.1628.71
    SS0.2621470.1171040.2651800.110150-1.165.94
    Table 12. Typical value of rotation
    Seq.DS-SLAMMultiImprovement
    RMSE /mS.D /mRMSE /mS.D /mRMSE /%S.D /%
    WX0.0221800.0144020.0191480.01068213.6725.83
    WH0.0320830.0177150.0288450.01573010.0911.21
    WR0.4338200.2282520.4077810.2052476.0010.08
    WS0.0077090.0032750.0073020.0034595.28-5.62
    SX0.0103390.0053770.0099620.0051293.654.61
    SH0.0148160.0066720.0145890.0066021.531.10
    SR0.0202420.0126800.0165310.01026818.3319.02
    SS0.0062730.0030850.0061420.0032162.09-4.25
    Table 13. Typical value of ATE
    Seq.DS-SLAMSemanticImprovement
    RMSE /mS.D /mRMSE /mS.D /mRMSE /%S.D /%
    WX0.0302830.0199770.0203900.01027532.6748.57
    WH0.0318930.0164580.0275000.01411913.7714.21
    WR0.1521890.1169610.0729580.05113852.0656.28
    WS0.0093470.0043580.0091720.0043831.87-0.57
    SX0.0126920.0066640.0125700.0063060.965.37
    SH0.0174030.0078080.0176590.007570-1.473.05
    SR0.0257570.0147380.0200020.01054522.3428.45
    SS0.0073270.0035640.0071210.0035182.811.29
    Table 14. Typical value of translation
    Seq.DS-SLAMSemanticImprovements
    RMSE /degS.D /degRMSE /degS.D /degRMSE /%S.D /%
    WX0.7662830.5293800.6149500.37995719.7528.23
    WH0.8462960.4220840.7990370.4027925.584.57
    WR3.0436192.3352321.4715571.02162251.6556.25
    WS0.2550130.1069530.2603830.114594-2.11-7.14
    SX0.4953920.2701480.4861030.2631591.882.59
    SH0.6307550.3006570.6495840.314669-2.99-4.66
    SR0.8430540.4680680.6672440.32338220.8530.91
    SS0.2621470.1171040.2582820.1132051.473.33
    Table 15. Typical value of rotation
    Seq.DS-SLAMSemanticImprovements
    RMSE /mS.D /mRMSE /mS.D /mRMSE /%S.D /%
    WX0.0221800.0144020.0158590.00828928.5042.45
    WH0.0320830.0177150.0261380.01356518.5323.43
    WR0.4338200.2282520.0573950.04152186.7781.81
    WS0.0077090.0032750.0070950.0032767.96-0.03
    SX0.0103390.0053770.0102580.0048920.789.02
    SH0.0148160.0066720.0142990.0063343.495.07
    SR0.0202420.0126800.0161590.00981620.1722.59
    SS0.0062730.0030850.0059030.0029465.904.51
    Table 16. Typical value of ATE
    vSLAMORB extractMoving checkSegmentation
    Proposed0.0081180.0129130.020810
    DS-SLAM0.0081180.0172990.027627
    Table 17. Time comparison of feature point extraction, motion consistency judgment, and semantic segmentation
    Rongfen Zhang, Wenhao Yuan, Jin Lu, Yuhong Liu. Visual Simultaneous Localization and Mapping Method of Semantic Octree Map Toward Indoor Dynamic Scenes[J]. Laser & Optoelectronics Progress, 2022, 59(18): 1811003
    Download Citation