计算机工程与应用 ›› 2023, Vol. 59 ›› Issue (8): 182-191.DOI: 10.3778/j.issn.1002-8331.2112-0412

• 模式识别与人工智能 • 上一篇    下一篇

集图卷积和三维方向卷积的点云分类分割模型

兰红,陈浩,张蒲芬   

  1. 江西理工大学 信息工程学院,江西 赣州 341000
  • 出版日期:2023-04-15 发布日期:2023-04-15

Point Cloud Classification and Segmentation Model Based on Graph Convolution and 3D Direction Convolution

LAN Hong, CHEN Hao, ZHANG Pufen   

  1. College of Information Engineering, Jiangxi University of Technology, Ganzhou, Jiangxi 341000, China
  • Online:2023-04-15 Published:2023-04-15

摘要: 现有的深度学习方法在提取点云的局部特征时往往忽略了节点间的位置关系和方向信息,导致不能有效地学习点云的局部特征。为解决这一问题,提出一种集图卷积和三维方向卷积的点云分类分割模型GCN3D。GCN3D模型将图卷积神经网络应用在点云分类分割领域。将点云视作图上的节点,对每个节点求其[K]近邻,建立局部[K]近邻邻域内两两节点之间的边,并通过图卷积神经网络参数化边特征以捕捉节点间局部位置关系并更新中心节点特征;使用方向编码模块将节点的邻域划分为八个方位的细粒度的邻域小块,并按照三维空间坐标轴的方向依次将局部邻域结构内的节点特征映射到不同细粒度邻域空间内以提取节点间的方向信息,并且叠加两个方向编码模块增大网络的感受野,提高模型对于稀疏点云数据的鲁棒性并获取局部邻域多尺度特征。在ModelNet40数据集和ShapeNet数据集上分别进行点云分类和点云部分分割的实验。结果表明,相比没有考虑局部特征信息的PointNet,GCN3D模型在ModelNet40数据集上的总体分类精度提高了3.8个百分点,平均分类精度提高了4.3个百分点;在ShapeNet数据集上的平均交并比提高了1.5个百分点。相比其他深度学习模型性能有不同程度的提高。

关键词: 点云, 分类分割, 图卷积神经网络, 三维方向卷积, 细粒度邻域, 多尺度

Abstract: The existing deep learning methods often ignore the position relationship and direction information between nodes when extracting the local features of point cloud, resulting in the inability to effectively learn the local features of point cloud. To solve this problem, a GCN3D model for point cloud classification and segmentation based on graph convolution and 3D direction convolution is proposed. GCN3D model applies graph convolution neural network to the field of point cloud classification and segmentation. Firstly, the point cloud is regarded as a node on the graph, its [K]-nearest neighbor is obtained for each node, the edge between two nodes in the local [K]-nearest neighbor neighborhood is established, and the edge features are parameterized by graph convolution neural network to capture the local position relationship between nodes and update the central node features. Then, the neighborhood of the node is divided into fine-grained neighborhood blocks in eight directions by using the direction coding module, and the node features in the local neighborhood structure are mapped into different fine-grained neighborhood spaces according to the direction of the 3D spatial coordinate axis to extract the direction information between the nodes, and two direction coding modules are superimposed to increase the receptive field of the network, improve the robustness of the model to sparse point cloud data and obtain local neighborhood multi-scale features. The experiments of point cloud classification and point cloud partial segmentation are carried out on ModelNet40 dataset and ShapeNet dataset respectively. The results show that compared with PointNet without considering local feature information, the overall classification accuracy of GCN3D model on ModelNet40 data set is improved by 3.8 percentage points, and the average classification accuracy is improved by 4.3 percentage points. The average intersection ratio on ShapeNet dataset is improved by 1.5 percentage points. Compared with other deep learning models, the performance of the model is improved in varying degrees.

Key words: point cloud, classification and segmentation, graph convolution neural network, three dimensional direction convolution, fine grained neighborhood, multiscale