3D Object Detection Based on Improved Frustum PointNet

Xunhua Liu; Shaoyuan Sun; Lipeng Gu; Xiang Li

doi:10.3788/LOP57.201508

Journals >Laser & Optoelectronics Progress >Volume 57 >Issue 20 >Page 201508 > Article

Laser & Optoelectronics Progress
Vol. 57, Issue 20, 201508 (2020)

3D Object Detection Based on Improved Frustum PointNet

Xunhua Liu^1、2、*, Shaoyuan Sun^1、2, Lipeng Gu^1、2, and Xiang Li^1、2

Author Affiliations

¹College of Information Science and Technology, Donghua University, Shanghai 201620, China

²Engineering Research Center of Digitized Textile & Fashion Technology, Ministry of Education, Donghua University, Shanghai 201620, China;

show less

DOI: 10.3788/LOP57.201508 Cite this Article Set citation alerts

Xunhua Liu, Shaoyuan Sun, Lipeng Gu, Xiang Li. 3D Object Detection Based on Improved Frustum PointNet[J]. Laser & Optoelectronics Progress, 2020, 57(20): 201508 Copy Citation Text

show less

Fig. 1. Improved F-PointNet structure

Download full size

Fig. 2. Network structure for extracting candidate regions of frustum point cloud

Download full size

Fig. 3. Registration results of 2D images and 3D point clouds. (a) RGB image; (b) 3D point cloud data; (c) registration effect of Fig. (a) and Fig. (b)

Download full size

Fig. 4. 3D target frustum candidate region initially obtained

Download full size

Fig. 5. Schematic of viewing frustum orientation adjustment

Download full size

Fig. 6. 3D target mask prediction network

Download full size

Fig. 7. Attention mechanism implementation process

Download full size

Fig. 8. 3D target bounding box prediction network

Download full size

Fig. 9. Coordinate transformation of target instance point cloud

Download full size

Fig. 10. Visual 3D target bounding box prediction results. (a) 2D target detection result; (b) 3D target detection result

Download full size

Item	CPU	Computing memory	GPU	System	CUDA
Content	Intel i5-6600	8 GB	NVIDIA GTX 1070	Ubuntu 16.04	CUDA 9.0

Table 1. Experimental configuration

x_margin	Car			Pedestrian			Cyclist
x_margin	Easy	Moderate	Hard	Easy	Moderate	Hard	Easy	Moderate	Hard
0	82.05	68.46	62.42	65.94	58.35	50.87	74.10	55.54	52.09
0.1	82.39	69.53	62.52	61.90	55.20	49.02	73.45	55.46	52.26
0.2	82.79	70.85	63.49	67.05	59.16	51.82	76.04	57.09	53.33
0.3	83.19	70.59	63.13	65.06	57.53	50.59	73.55	55.76	52.73

Table 2. AP values of 3D target detection under each threshold unit: %

Part			AP /%
Wide-threshold mask(x_margin=0.2)	Attention mechanism	Focal Loss	Easy	Moderate	Hard
-	-	-	82.05	68.46	62.42
√	-	-	82.79	70.85	63.49
-	√	-	81.89	69.23	62.54
-	-	√	82.73	69.89	63.27
√	√	√	83.04	71.25	63.82

Table 3. Influence of each processing part on AP values

Method	AP /%
Method	Easy	Moderate	Hard
MV3D^[4]	71.29	62.28	56.56
F-PointNet^[5]	82.05	68.46	62.42
UberATG-ContFuse^[14]	82.54	66.22	64.04
MLOD^[15]	72.24	64.20	57.20
Proposed	83.04	71.25	63.82

Table 4. Comparison of AP values of different models

Xunhua Liu, Shaoyuan Sun, Lipeng Gu, Xiang Li. 3D Object Detection Based on Improved Frustum PointNet[J]. Laser & Optoelectronics Progress, 2020, 57(20): 201508

Download Citation

Set citation alerts for the article

Tools

Set citation alerts for the article

Save the article for my favorites

Paper Information