• Electronics Optics & Control
  • Vol. 27, Issue 12, 84 (2020)
YANG Shengjie1, QIU Zhenan2, GAO Xiaoning3, and LI Jianxun3
Author Affiliations
  • 1[in Chinese]
  • 2[in Chinese]
  • 3[in Chinese]
  • show less
    DOI: 10.3969/j.issn.1671-637x.2020.12.018 Cite this Article
    YANG Shengjie, QIU Zhenan, GAO Xiaoning, LI Jianxun. RGBD Semantic Segmentation Based on Depth-Sensitive Spatial Pyramid Pooling[J]. Electronics Optics & Control, 2020, 27(12): 84 Copy Citation Text show less

    Abstract

    The RGBD semantic segmentation model based on the standard 2D convolution kernel mostly takes the depth map as a single channel.Due to the limitation of convolution kernel characteristics, the geometric structure information brought by the depth information cannot be fully exploited.To overcome this defect, this paper constructs depth-sensitive convolution kernels and a pooling layer to make rich mining of depth information,and uses depth-sensitive spatial pyramid pooling to extract multi-scale information, so as to realize the segmentation of objects of different scales.Results of experiment on NYU v2 and SUN RGB-D datasets show that this method effectively improves the overall semantic segmentation accuracy.
    YANG Shengjie, QIU Zhenan, GAO Xiaoning, LI Jianxun. RGBD Semantic Segmentation Based on Depth-Sensitive Spatial Pyramid Pooling[J]. Electronics Optics & Control, 2020, 27(12): 84
    Download Citation