• Laser & Optoelectronics Progress
  • Vol. 58, Issue 16, 1600003 (2021)
Pei Wen1、2, Yinglei Cheng1、*, and Wangsheng Yu1
Author Affiliations
  • 1Information and Navigation College, Air Force Engineering University, Xi'an, Shaanxi 710077, China
  • 2The 93575 Unit of PLA, Chengde, Hebei 067000, China
  • show less
    DOI: 10.3788/LOP202158.1600003 Cite this Article Set citation alerts
    Pei Wen, Yinglei Cheng, Wangsheng Yu. Point Cloud Classification Methods Based on Deep Learning: A Review[J]. Laser & Optoelectronics Progress, 2021, 58(16): 1600003 Copy Citation Text show less
    Architecture of MVCNN for point cloud classification and segmentation
    Fig. 1. Architecture of MVCNN for point cloud classification and segmentation
    Architecture of GVCNN for point cloud classification and segmentation
    Fig. 2. Architecture of GVCNN for point cloud classification and segmentation
    Architecture of MHBN for point cloud classification and segmentation
    Fig. 3. Architecture of MHBN for point cloud classification and segmentation
    Architecture of 3D ContextNet for point cloud classification and segmentation
    Fig. 4. Architecture of 3D ContextNet for point cloud classification and segmentation
    Architecture of SEGCloud for point cloud semantic segmentation[57]
    Fig. 5. Architecture of SEGCloud for point cloud semantic segmentation[57]
    Architecture of PointNet++ for point cloud classification and segmentation[60]
    Fig. 6. Architecture of PointNet++ for point cloud classification and segmentation[60]
    Architecture of dense-resolution network for point cloud classification and segmentation[65]
    Fig. 7. Architecture of dense-resolution network for point cloud classification and segmentation[65]
    Architecture of RandLA-Net for point cloud semantic segmentation[66]
    Fig. 8. Architecture of RandLA-Net for point cloud semantic segmentation[66]
    Schematic diagram of a graph-based network[31]
    Fig. 9. Schematic diagram of a graph-based network[31]
    Architecture of SpecGCN for point cloud classification and segmentation[12]
    Fig. 10. Architecture of SpecGCN for point cloud classification and segmentation[12]
    Architecture of LKPO-GNN for point cloud classification and segmentation[81]
    Fig. 11. Architecture of LKPO-GNN for point cloud classification and segmentation[81]
    An illustration of the continuous and discrete convolutions for local neighborhoods of a point[31]. (a) Local neighborhoods of a point; (b) 3D continuous convolution; (c) 3D discrete convolution
    Fig. 12. An illustration of the continuous and discrete convolutions for local neighborhoods of a point[31]. (a) Local neighborhoods of a point; (b) 3D continuous convolution; (c) 3D discrete convolution
    Schematics of several typical convolutions. (a) Pointwise Conv[13]; (b) GeoConv[84]; (c) RIConv[88]; (d) SPHConv[99]; (e) convolutional layer of Convpoint[100]; (f) RS-Conv[96]
    Fig. 13. Schematics of several typical convolutions. (a) Pointwise Conv[13]; (b) GeoConv[84]; (c) RIConv[88]; (d) SPHConv[99]; (e) convolutional layer of Convpoint[100]; (f) RS-Conv[96]
    Principle of attention coefficients generation
    Fig. 14. Principle of attention coefficients generation
    Architecture of PATNet for point cloud classification and segmentation
    Fig. 15. Architecture of PATNet for point cloud classification and segmentation
    MethodYearKey ideaApplication scenarioDatasetAccuracy /%
    OAMAmIoU
    Multi-view based methodMVCNN[37]2015Learning to recognize 3D shapes from a collection of their rendered views on 2D images3D shape recognitionModelNet4090.10
    RCPCNN[39]2017Introducing a view clustering and pooling layer based on dominant sets3D object recognitionModelNet4093.80
    SnapNet[42]2017Transferring the very impressive results of 2D deep segmentation networks to 3D3D semantic segmentationSemantic 3D88.6070.8059.10
    SUN RGB-D67.40
    SnapNet-R[43]2017Using 3D-coherent synthesis of scene observations and mixing them in a multi-view framework for 3D labelingSemantic labeling of the scene perceived by a robotSUN RGB-D78.0439.61
    GVCNN[38]2018Using a grouping strategy3D shape classification and retrievalModelNet4093.10
    MHBN[40]2018Aggregating local convolutional features through bilinear pooling3D object recognitionModelNet4094.91
    ModelNet1092.23
    In the Ref. [44]2019Combining CNNs with LSTM to exploit the correlative information from multiple views3D shape recognition and 3D shape retrievalModelNet4091.05
    ModelNet1095.29
    LU-Net[45]2019Embedding 3D local features in 2D range-images; using a U-NetSolving the image processing problem of 3D LiDAR point cloudKITTI55.40
    3D-MiniNet[46]2020Combining 3D and 2D learning layers; learning the 2D representation through a novel projectionFast and efficient for 3D LIDAR point cloudSemantic-KITTI55.80
    KITTI58.00
    Volumetric methodVoxNet[47]2015Integrating a volumetric occupancy grid representation with a supervised 3D CNNReal-time object recognitionModelNet4085.9083.00
    ModelNet1092.00
    3D ShapeNet[48]2015Using aconvolutional deep belief network to represent a geometric 3D shape as a probability distribution of binary variables on a 3D voxel gridJoint object recognition and shape completion from 2.5D depth mapsModelNet4084.7077.30
    ModelNet1083.50
    MethodYearKey ideaApplication scenarioDatasetAccuracy /%
    OAMAmIoU
    Volumetric methodOctNet[49]2016Using a set of unbalanced octrees to exploit the sparsity in the input data to hierarchically partition the space3D object classification, orientation estimation and point cloud labelingModelNet4086.50
    ModelNet1090.90
    FPNN[55]2016Representing 3D spaces as volumetric fields, using field probing filters to extract features3D object recognitionModelNet4087.50
    FusionNet[59]2016Using both voxel and pixel representations for training relatively weak classifiers3D CAD models classificationModelNet4090.80
    ModelNet1093.11
    O-CNN[50]2017Storing the octant information and CNN features into the graphics memory and executing the entire O-CNN training and evaluation on the GPU3D object classification, shape retrieval, and shape segmentationModelNet4090.6085.90
    Kd-Net[52]2017Performing multiplicative transformations and sharing parameters according to the subdivisions of the point clouds imposed onto them by kdtrees3D shape classification, shape retrieval, and shape part segmentationModelNet4091.80
    ModelNet1094.0077.20
    3D ContextNet[53]2017Exploiting the local and global contextual cues imposed by the implicit space partition of the K-d tree for feature learning3D object classification and part segmentationS3DIS84.9074.5055.60
    SEGCloud[57]2017Combining 3D-FCNN, trilinear interpolation(TI), and fully connected conditional random fields (FC-CRF).3D semantic segmentation (indoor and outdoor scenes)Semantic 3D73.0860.30
    S3DIS57.3548.92
    NYUv266.8256.4343.45
    KITTI49.4636.78
    MSNet[54]2018Multi-scale voxelizationAdaptive and robust point cloud classificationMLS83.18
    TLS98.24
    ALS97.02
    PointGrid[56]2018Incorporating a constant number of points within each grid cell3D visual recognitionModelNet4092.0088.90
    ShapeNet86.1080.5086.40
    VV-net[58]2018Using a kernel-based interpolated variational autoencoder (VAE) architecture to encode the local geometry within each voxel3D object segmentation into parts and scenes segmentation into individual objects; normal estimationShapeNet87.40
    S3DIS87.7878.22
    Table 1. Comparison for methods based on regular representation
    MethodYearKey ideaApplication scenarioDatasetAccuracy /%
    OAMAmIoU
    Neighboring feature poolingPointNet[11]2017Using a single symmetric function, max pooling3D object classification, part segmentation, scene semantic parsingModelNet4089.286.2
    ShapeNet83.7
    S3DIS78.6247.71
    Point-Net++[60]2017Processing a set of points sampled in a metric space using a hierarchical fashionProcessing point sets sampled in a metric spaceModelNet4091.9
    ShapeNet85.1
    PointSIFT[61]2018Stacking several orientation-encoding units to achieve multi-scale representationImproving 3D shape representationS3DIS88.7270.23
    SO-Net[62]2018Building a self-organizing map (SOM) to model the spatial distribution of point cloudPoint cloud reconstruction, classification, object part segmentation and shape retrievalModelNet4093.4
    ShapeNet84.6
    3DMAX-Net[63]2018Multi-scale contextual feature learning, local and global feature aggregation3D semantic segmentation on large-scale point cloudsS3DIS79.547.5
    PointWeb[64]2019Using adaptive feature adjustment (AFA) module to find the interaction between points3D point cloud segmentation and classificationModelNet4092.389.4
    S3DIS86.9766.6460.28
    In Ref. [65]2020Learning local point features from point cloud in different resolutionsPoint cloud analysisModelNet4093.1
    ShapeNet86.4
    ScanObjectNN80.3
    RandLA-Net[66]2020Using random point sampling instead of more complex point selection approaches3D semantic segmentation on large-scale point cloudsSemantic 3D94.877.4
    KITTI53.9
    Graph-based methodsIn Ref. [70]2017Performing convolutions over local graph neighborhoods exploiting edge labelsGraph classificationModelNet4087.4
    ModelNet1090.8
    SPG[71]2018Capturing the organization of 3D point clouds by superpoint graph (SPG)3D semantic segmentation on large-scale point cloudsSemantic3D92.976.2
    S3DIS85.573.062.1
    MethodYearKey ideaApplication scenarioDatasetAccuracy /%
    OAMAmIoU
    Graph-based methodsSpecGCN[12]2018Leveraging the power of spectral graph CNNs in the PointNet++ framework while adopting a different pooling strategy3D point cloud segmentation and classificationModelNet4091.5
    ShapeNet84.6
    RGCNN[74]2018Adding graph-signal smoothness a prior in the loss function3D point cloud segmentation and classificationModelNet4090.587.3
    ShapeNet84.3
    DGCNN[75]2018Using EdgeConv to capture and exploit fine-grained geometric properties of point clouds3D point cloud segmentation and classificationModelNet4092.290.2
    ShapeNet85.1
    LDGCNN[76]2019Removing the transformation network; linking hierarchical features from different dynamic graphs3D point cloud segmentation and classificationModelNet4092.990.3
    ShapeNet85.1
    In Ref. [72]2019Using a simple point embedding network and a new graph-structured loss function3D semantic segmentation on large-scale point cloudsS3DIS87.978.368.4
    vKITTI84.367.352.0
    In Ref. [77]2019Stacking DPAM module to gradually agglomerate points3D point cloud segmentation and classificationModelNet4091.9
    ModelNet1094.6
    ShapeNet86.1
    S3DIS64.5
    HDGCN[79]2019Combining the hierarchical structure and the DGConv block to extract both local and global features of point clouds hierarchically3D semantic segmentation (indoor and outdoor scenes)S3DIS76.1166.85
    Paris-Lille-3D68.30
    PointNGCNN[80]2020Using the Chebyshev polynomials as the graph filters to extract features in the neighborhood of each pointCapturing the potential geometric information of 3D objectsModelNet4092.8
    ShapeNet85.6
    S3DIS87.3
    ScanNet84.9
    LKPO-GNN[81]2020Using LKPO-GNN to select multi-directional k-NNs to form the local topological structure of a centroidObtaining deeper feature representationModelNet4091.488.9
    ShapeNet85.6
    S3DIS85.864.6
    ScanNet85.358.4
    CPL-Net[82]2020Using critical points layer (CPL) to reduce the number of points in an unordered point cloud and retain the important (critical) ones3D object classificationModelNet4092.4190.53
    MethodYearKey ideaApplication scenarioDatasetAccuracy /%
    OAMAmIoU
    Kernel-based convolutionPointwise CNN[13]2017Pointwise convolution which can be applied at each point in a point cloud to learn point-wise features3D semantic segmentation and object recognitionS3DIS74.1
    PointCNN[83]2018X-Conv; weighting and permuting input points and features before processed by a typical convolutionLeveraging spatially-local correlation from data represented in point cloudModelNet4092.588.8
    S3DIS65.39
    ShapeNet84.6
    ScanNet79.755.7
    PCCN[91]2018Exploiting parameterized kernel functions which span the full continuous vector spacePoint cloud segmentation (indoor and outdoor scenes), lidar motion estimation of driving scenesStanford Large-Scale 3D Indoor Scene Dataset67.0158.27
    Driving Scenes Dataset95.4558.06
    SpiderCNN[92]2018SpiderConv; extending convolutional operations from regular grids to irregular point sets3D point cloud segmentation and classificationModelNet4092.4
    ShapeNet85.3
    GeoCNN[84]2019GeoConv; modeling the geometric structure of points by a decomposition and aggregation method based on vector decomposition3D shape classification, segmentation and object detectionModelNet4093.991.6
    InterpCNN[85]2019Interp Conv; using discrete convolutional kernels and an interpolation function to explicitly measure geometric relations between input point clouds and kernel-weight coordinates3D shape classification, object part segmentation and indoor scene semantic parsingModelNet4093.0
    S3DIS88.766.7
    ShapeNet86.3
    A-CNN[86]2019Capturing the local neighborhood geometry of each point by specifying the (regular and dilated) ring-shaped structures and directions in the computationObject classification, part segmentation, and semantic segmentation in large-scale scenesModelNet4092.690.3
    ModelNet1095.595.3
    S3DIS87.3
    ShapeNet86.1
    ScanNet85.4
    In Ref. [88]2019RIConv; using low-level rotation invariant geometric features such as distances and angles to design a convolution operator for point cloud learning3D object classification and segmentationModelNet4086.5
    ShapeNet75.5
    Driving Scenes Dataset95.4558.06
    MethodYearKey ideaApplication scenarioDatasetAccuracy /%
    OAMAmIoU
    Kernel-based convolutionIn Ref. [93]2019PointConv; taking the positions of point clouds as input and learning an MLP to approximate a weight function, then applying a inverse density scale on the learned weights to compensate the non-uniform sampling3D semantic segmentation; convolutional networks in 2D images of a similar structureModelNet4092.5
    ShapeNet85.7
    ScanNet55.6
    In Ref. [95]2019KPConv; using a set of kernel points to define the area where each kernel weight is appliedAdapting to the geometry of the scene objectsModelNet4092.9
    ShapeNet86.4
    SPHNet[99]2019SPHConv; employing a spherical harmonics based kernel at different layers of the network3D shape deep learning tasksModelNet4087.7
    ConvPoint[100]2019Using continuous convolution and a hierarchical data representation structure based on a search treeLarge scale indoor and outdoor semantic segmentationModelNet4092.589.6
    ShapeNet85.8
    RS-CNN[96]2020RS-Conv; learning from the geometric topology constraint among pointsEncoding meaningful shape information in 3D point cloudModelNet4093.6
    ShapeNet86.2
    Attention-based methodsA-SCN[113]2018Adopting shape context as the basic building block acting like convolution in CNN3D point cloud classification and segmentationShapeNet84.6
    PAN[109]2018Combining Feature Pyramid Attention (FPA) module and Global Attention Upsample (GAU)3D point cloud semantic segmentation (urban scenes)PASCAL VOC 201295.784.0
    Cityscapes78.6
    PryramNet[110]2019Combining Graph Embedding Module(GEM) and Pyramid Attention Network(PAN)3D object classification and semantic segmentationModelNet4091.588.3
    S3DIS85.655.6
    ShapeNet83.9
    GAPNet[107]2019GAPLayer; embedding graph attention mechanism within stacked Multi-Layer-Perceptron (MLP) layers to learn local geometric representations3D shape classification and part segmentationModelNet4092.489.7
    ShapeNet84.7
    MethodYearKey ideaApplication scenarioDatasetAccuracy /%
    OAMAmIoU
    Attention-based methodsPAT[108]2019Using a parameter-efficient Group Shuffle Attention (GSA) to replace the costly Multi-Head Attention; Gumbel Subset Sampling (GSS)Hierarchical multiple instance learningModelNet4091.7
    S3DIS64.28
    LSANet[111]2019Generating Spatial Distribution Weights (SDWs) hierarchically based on the spatial relationship in local region for spatial independent operations3D object classification, part segmentation, and semantic segmentationModelNet4092.389.2
    S3DIS86.862.2
    ShapeNet83.2
    ScanNet85.1
    In Ref. [114]2019Attention-based score refinement (ASR) moduleImproving the segmentation accuracyShapeNet85.6
    GACNet[106]2020Assigning proper attentional weights to different neighboring points3D point cloud semantic segmentationSemantic3D91.970.8
    S3DIS87.7962.85
    In Ref. [115]2020Local Attention-Edge Convolution (LAE-Conv); constructing a local graph based on the neighborhood points searched in multi-directionsPredicting dense labels for 3D point cloud segmentationS3DIS88.9566.3
    ShapeNet85.9
    ScanNet88.646.9
    Table 2. Comparison for methods based on original point clouds
    Pei Wen, Yinglei Cheng, Wangsheng Yu. Point Cloud Classification Methods Based on Deep Learning: A Review[J]. Laser & Optoelectronics Progress, 2021, 58(16): 1600003
    Download Citation