• Acta Optica Sinica
  • Vol. 42, Issue 19, 1911001 (2022)
Hao Sha, Yue Liu*, Yongtian Wang, Chenguang Lu, and Mengze Zhao
Author Affiliations
  • Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing 100081, China
  • show less
    DOI: 10.3788/AOS202242.1911001 Cite this Article Set citation alerts
    Hao Sha, Yue Liu, Yongtian Wang, Chenguang Lu, Mengze Zhao. Monocular Indoor Depth Estimation Method Based on Neural Networks with Constraints on Two-Dimensional Images and Three-Dimensional Geometry[J]. Acta Optica Sinica, 2022, 42(19): 1911001 Copy Citation Text show less
    Principle of calculating the normal of nearest neighbor point sampling method
    Fig. 1. Principle of calculating the normal of nearest neighbor point sampling method
    Feature connection module based on depth channel attention mechanism
    Fig. 2. Feature connection module based on depth channel attention mechanism
    Overall architecture of monocular depth estimation method
    Fig. 3. Overall architecture of monocular depth estimation method
    Architecture of encoder and decoder sub-networks. (a) Sub-network structure of encoder; (b)-(d) subnetwork structures of decoder
    Fig. 4. Architecture of encoder and decoder sub-networks. (a) Sub-network structure of encoder; (b)-(d) subnetwork structures of decoder
    Depth prediction results of different methods on NYU Depth v2 dataset
    Fig. 5. Depth prediction results of different methods on NYU Depth v2 dataset
    3D reconstruction results based on monocular depth
    Fig. 6. 3D reconstruction results based on monocular depth
    Qualitative results of ablation experiments based on network architecture
    Fig. 7. Qualitative results of ablation experiments based on network architecture
    Qualitative results of ablation experiments based on constraints
    Fig. 8. Qualitative results of ablation experiments based on constraints
    Quantitative results of test set in range of different depth values
    Fig. 9. Quantitative results of test set in range of different depth values
    Quantitative results of selected images in range of different depth values. (a) 10 images with worst RMSE; (b) 10 images with worst REL; (c) 10 images with worst TH1
    Fig. 10. Quantitative results of selected images in range of different depth values. (a) 10 images with worst RMSE; (b) 10 images with worst REL; (c) 10 images with worst TH1
    MethodRMSERELδ<1.25δ<1.252δ<1.253
    Ref. [180.9070.2150.6110.8870.971
    Ref. [370.8240.2300.6140.8830.971
    Ref. [380.6200.1490.8060.8830.987
    Ref. [390.6350.1430.7880.9580.991
    Ref. [400.8190.2320.6460.8920.968
    Ref. [240.6410.1580.7690.9500.988
    Ref. [190.5730.1270.8110.9530.988
    Ref. [220.5860.1210.8110.9540.987
    Ref. [260.6000.1440.7910.9600.991
    Ref. [410.5720.1390.8150.9630.991
    Ref. [270.5990.1590.7720.9420.984
    Ref. [370.5550.1260.8430.9680.991
    This paper0.5520.1640.7680.9400.984
    Table 1. Quantitative comparison between proposed method and other different methods on NYU Depth v2 dataset
    MethodRuning time /msFrame rate /(frame·s-1RMSE
    Ref. [1823430.907
    Ref. [19237100.604
    Ref. [249660.753
    This paper58170.552
    Table 2. Comparison of running speeds of different methods
    MethodRMSERELδ<1.25δ<1.252δ<1.253
    Without skip connect0.7270.2220.6310.8850.969
    Without SE_Concat_Block0.6040.1770.7310.9220.976
    Baseline0.5860.1780.7380.9320.982
    U-net0.6470.2020.6810.9150.978
    Resnet-1010.6280.1890.7040.9210.981
    Table 3. Quantitative results of ablation experiments based on network architecture
    MethodRMSERELδ<1.25δ<1.252δ<1.253
    Baseline0.5940.1770.7400.9260.980
    With L2D0.5860.1780.7380.9320.982
    With L2D and LG0.5610.1650.7610.9350.983
    With L2D,LG, and LL0.5520.1640.7680.9400.984
    Table 4. Quantitative results of ablation experiments based on constraints
    Hao Sha, Yue Liu, Yongtian Wang, Chenguang Lu, Mengze Zhao. Monocular Indoor Depth Estimation Method Based on Neural Networks with Constraints on Two-Dimensional Images and Three-Dimensional Geometry[J]. Acta Optica Sinica, 2022, 42(19): 1911001
    Download Citation