Object detection algorithm based on image and point cloud fusion with N3D_DIOU

Bao-qing GUO; Guang-fei XIE

doi:10.37188/OPE.20212911.2703

Journals >Optics and Precision Engineering >Volume 29 >Issue 11 >Page 2703 > Article

Optics and Precision Engineering
Vol. 29, Issue 11, 2703 (2021)

Object detection algorithm based on image and point cloud fusion with N3D_DIOU

Bao-qing GUO^1,2,* and Guang-fei XIE¹

Author Affiliations

¹School of Mechanical， Electronic and Control Engineering， Beijing Jiaotong University， Beijing00044， China

²Frontiers Science Center for Smart High-speed Railway System， Beijing Jiaotong University， Beijing100044， China

show less

DOI: 10.37188/OPE.20212911.2703 Cite this Article

Bao-qing GUO, Guang-fei XIE. Object detection algorithm based on image and point cloud fusion with N3D_DIOU[J]. Optics and Precision Engineering, 2021, 29(11): 2703 Copy Citation Text

show less

Fig. 1. Detection network framework

Download full size | View in the Article

Fig. 2. Vote model network

Download full size | View in the Article

Fig. 3. Structure of FCN network

Download full size | View in the Article

Fig. 4. Relationship between three detection boxes and target boxes

Download full size | View in the Article

Fig. 5. Target box and detection box with angle deviation

Download full size | View in the Article

Fig. 6. 3D detection AP and recall curves for cars， pedestrians and cyclists

Download full size | View in the Article

Fig. 7. Visualization results of cars

Download full size | View in the Article

Fig. 8. Visualization results of pedestrians and cyclists

Download full size | View in the Article

算法1：N3D_DIOU_loss

输入：检测框 B^p、目标框B^g 、预测中心 C^p 和目标框中心 C^g：

B^p=（ $x_{1}^{p}$ ， $y_{1}^{p}$ ， $z_{1}^{p}$ ， $x_{2}^{p}$ ， $y_{2}^{p}$ ， $z_{2}^{p}$ ）

B^g=（ $x_{1}^{g}$ ， $y_{1}^{g}$ ， $z_{1}^{g}$ ， $x_{2}^{g}$ ， $y_{2}^{g}$ ， $z_{2}^{g}$ ）

C^p=（ $x_{c}^{p}$ ， $y_{c}^{p}$ ， $z_{c}^{p}$ ）

C^g=（ $x_{c}^{g}$ ， $y_{c}^{g}$ ， $z_{c}^{g}$ ）

输出：由于目标框与检测框事先与坐标轴对齐，可以确保： $x_{2}^{p}$ > $x_{1}^{p}$ ， $y_{2}^{p}$ > $y_{1}^{p}$ ， $z_{2}^{p}$ > $z_{1}^{p}$ ， $x_{2}^{g}$ > $x_{1}^{g}$ ， $y_{2}^{g}$ > $y_{1}^{g}$ ， $z_{2}^{g}$ > $z_{1}^{g}$

1. 计算B^g的体积：V^g=（ $x_{2}^{g}$ - $x_{1}^{g}$ ）·（ $y_{2}^{g}$ - $y_{1}^{g}$ ）·（ $z_{2}^{g}$ - $z_{1}^{g}$ ）

2. 计算B^p的体积：V^p=（ $x_{2}^{p}$ - $x_{1}^{p}$ ）·（ $y_{2}^{p}$ - $y_{1}^{p}$ ）·（ $z_{2}^{p}$ - $z_{1}^{p}$ ）

3. 计算两框交集的体积（Vⁱ）：

$x_{1}^{i}$ =max（ $x_{1}^{p}$ ， $x_{1}^{g}$ ）， $x_{2}^{i}$ =min（ $x_{2}^{p}$ ， $x_{2}^{g}$ ）

$y_{1}^{i}$ =max（ $y_{1}^{p}$ ， $y_{1}^{g}$ ）， $y_{2}^{i}$ =min（ $y_{2}^{p}$ ， $y_{2}^{g}$ ）

$z_{1}^{i}$ =max（ $z_{1}^{p}$ ， $z_{1}^{g}$ ）， $z_{2}^{i}$ =min（ $z_{2}^{p}$ ， $z_{2}^{g}$ ）

If $x_{2}^{i}$ > $x_{1}^{i}$ ， $y_{2}^{i}$ > $y_{1}^{i}$ ， $z_{2}^{i}$ > $z_{1}^{i}$ ：

$V^{i} = (x_{2}^{i} - x_{1}^{i}) \cdot (y_{2}^{i} - y_{1}^{i}) \cdot (z_{2}^{i} - z_{1}^{i})$

Otherwise： Vⁱ=0

4. 计算两框最小包围边界框的体积（V^c）：

$x_{1}^{c}$ =min（ $x_{1}^{p}$ ， $x_{1}^{g}$ ）， $x_{2}^{c}$ =max（ $x_{2}^{p}$ ， $x_{2}^{g}$ ）

$y_{1}^{c}$ =min（ $y_{1}^{p}$ ， $y_{1}^{g}$ ）， $y_{2}^{c}$ =max（ $y_{2}^{p}$ ， $y_{2}^{g}$ ）

$z_{1}^{c}$ =min（ $z_{1}^{p}$ ， $z_{1}^{g}$ ）， $z_{2}^{c}$ =max（ $z_{2}^{p}$ ， $z_{2}^{g}$ ）

$V^{c} = (x_{2}^{c} - x_{1}^{c}) \cdot (y_{2}^{c} - y_{1}^{c}) \cdot (z_{2}^{c} - z_{1}^{c})$

5. 计算目标框和检测框的中心之间的距离ρ，以及最小边界框的对角线距离c：

ρ²=（ $x_{c}^{p}$ - $x_{c}^{g}$ ）²+（ $y_{c}^{p}$ - $y_{c}^{g}$ ）²+（ $z_{c}^{p}$ - $z_{c}^{g}$ ）²

c²=（ $x_{2}^{c}$ - $x_{1}^{c}$ ）²+（ $y_{2}^{c}$ - $y_{1}^{c}$ ）²+（ $z_{2}^{c}$ - $z_{1}^{c}$ ）²

6. IOU_3D = $\frac{V^{i}}{V^{u}}$ ，其中 V^u=V^p+V^g-Vⁱ

7. $D I O U_3 D = I O U_3 D - \frac{ρ^{2} (b, b^{g t})}{c^{2}}$

8. DIOU_3D_loss=1 - DIOU_3D

9. N3D_DIOU_loss=

ω₃·DIOU_3D_loss+ω₄·L1_angle_loss

（ω₃和ω₄为权重系数，本文中分别设为0.5与0.03，L1_angle_loss是L1损失函数，用于监督角度偏差。）

Table 1.

Algorithm 1

Pseudo Code of N3D_DIOU_loss

View in the Article

算法

汽车

行人

骑车者

简单中等困难

MV3D^［1］

ContFusion^［28］

VoxelNet^［15］

F-PointNet^［31］

F-ConvNet^［24］

IPOD^［29］

PointPillars^［30］

71.29 62.68 56.56

86.32 73.25 67.81

81.97 65.46 62.85

83.76 70.92 63.65

89.02 78.80 77.09

84.10 76.40 75.30

79.05 74.99 68.30

N/A N/A N/A

57.86 53.42 48.87

70.00 61.32 53.59

N/A N/A N/A

69.60 62.30 54.60

52.08 43.53 41.49

N/A N/A N/A

67.17 47.65 45.11

77.15 56.49 53.37

N/A N/A N/A

81.90 57.10 54.60

75.78 59.07 52.92

本文算法

89.73 79.43 77.79

70.37 58.70 51.75

80.88 60.4356.93

Table 1. 3D detection AP （%） of cars， pedestrians and cyclists on KITTI val set

View in the Article

算法

汽车

行人

骑车者

简单中等困难

MV3D^［1］

ContFusion^［28］

VoxelNet^［15］

F-PointNet^［31］

F-ConvNet^［24］

IPOD^［29］

PointPillars^［30］

86.55 78.10 76.67

95.44 87.34 82.43

89.60 84.81 78.57

88.16 84.92 76.44

90.23 88.79 86.84

88.30 86.40 84.60

88.35 86.10 79.83

N/A N/A N/A

65.95 61.05 56.98

72.38 66.39 59.57

N/A N/A N/A

72.40 67.8059.70

58.66 50.23 47.19

N/A N/A N/A

74.41 52.18 50.49

81.82 60.03 56.32

N/A N/A N/A

84.30 61.80 57.70

79.14 62.25 56.00

本文算法

97.51 89.0586.99

72.59 63.57 59.21

86.21 65.6660.58

Table 2. BEV detection AP（%） of cars， pedestrians and cyclists on KITTI val set

View in the Article

算法

检测精度

简单中等困难

F-ConvNet

F-ConvNet+投票模型

F-ConvNet+N3D-DIOU_loss

F-ConvNet+投票模型+N3D-DIOU_loss

89.02 78.80 77.09

89.23 79.06 77.42

89.34 79.21 77.63

89.73 79.43 77.79

BEV

F-ConvNet

F-ConvNet+投票模型

F-ConvNet+N3D-DIOU_loss

F-ConvNet+投票模型+N3D-DIOU_loss

90.23 88.79 86.86

90.53 89.13 86.92

90.31 88.98 86.63

97.51 89.05 86.99

Table 3. 3D and BEV detection performance

View in the Article

算法

检测精度

简单中等困难

无

微调

F-ConvNet

F-ConvNet+投票模型

F-ConvNet+N3D-DIOU_loss

F-ConvNet+投票模型+N3D-DIOU_loss

86.51 76.57 68.17

87.73 77.00 68.42

88.06 77.49 68.76

88.47 77.83 69.04

参数

微调

F-ConvNet

F-ConvNet+投票模型

F-ConvNet+N3D-DIOU_loss

F-ConvNet+投票模型+N3D-DIOU_loss

89.02 78.80 77.09

89.23 79.06 77.42

89.34 79.21 77.63

89.73 79.43 77.79

Table 4. Comparison of parameter tuning experiments

Bao-qing GUO, Guang-fei XIE. Object detection algorithm based on image and point cloud fusion with N3D_DIOU[J]. Optics and Precision Engineering, 2021, 29(11): 2703

Download Citation

Tools

Save the article for my favorites

Paper Information

微信扫一扫：分享

微信扫一扫：分享