• Opto-Electronic Engineering
  • Vol. 48, Issue 5, 200418 (2021)
Zhang Jiesong, Huang Yingping*, and Zhang Rui
Author Affiliations
  • [in Chinese]
  • show less
    DOI: 10.12086/oee.2021.200418 Cite this Article
    Zhang Jiesong, Huang Yingping, Zhang Rui. Fusing point cloud with image for object detection using convolutional neural networks[J]. Opto-Electronic Engineering, 2021, 48(5): 200418 Copy Citation Text show less

    Abstract

    Addressing on the issues like varying object scale, complicated illumination conditions, and lack of reliable distance information in driverless applications, this paper proposes a multi-modal fusion method for object detection by using convolutional neural networks. The depth map is generated by mapping LiDAR point cloud onto the image plane and taken as input data together with the RGB image. The input data is also processed by the sliding window to reduce information loss. Two feature extracting networks are used to extract features of the image and the depth map respectively. The generated feature maps are fused through a connection layer. The objects are detected by processing the fused feature map through position regression and object classification. Non-maximal suppression is used to optimize the detection results. The experimental results on the KITTI dataset show that the proposed method is robust in various illumination conditions and especially effective on detecting small objects. Compared with other methods, the proposed method exhibits integrated advantages in terms of detection accuracy and speed.
    Zhang Jiesong, Huang Yingping, Zhang Rui. Fusing point cloud with image for object detection using convolutional neural networks[J]. Opto-Electronic Engineering, 2021, 48(5): 200418
    Download Citation