用于三维点云分割和分类的高分辨率特征网络

    High-resolution Feature Network for 3D Point Cloud Segmentation and Classification

    • 摘要: 多尺度特征在点云领域的密集预测任务中至关重要。当前三维点云处理技术主要依赖编码器−解码器框架,通过主干网络提取并融合多尺度特征。然而,这些方法通常采用延迟融合策略,导致特征集成不足。为解决这一问题,本文提出了HRFN3D(High-resolution Feature Network for 3D Point Cloud)模型,一种专为点云分类和分割任务设计的高分辨率特征网络。HRFN3D通过创新性的关系学习模块,在早期阶段进行特征融合,促进低分辨率高语义点与高分辨率低语义点的交互,使高分辨率点在早期阶段就保留高语义信息,优化后续特征学习。在后期,结合不同池化策略生成全局特征向量,并与原始点特征拼接,既保留细节,又增强全局特征的代表性。实验结果显示,HRFN3D在ShapeNetPart数据集上将类平均交并比和实例平均交并比分别提升了2.2个百分点和0.9个百分点,并获得了最佳实例平均交并比86.3%;在ModelNet40数据集上,以4.3 M的参数量实现了91.5%的最高类平均精度。这些结果验证了HRFN3D在多尺度特征处理中的有效性。

       

      Abstract: Multi-scale features are critical in dense prediction tasks within the point cloud domain. Existing 3D point cloud processing techniques predominantly rely on encoder-decoder frameworks, which extract and integrate multiscale features via a backbone network. However, these methods often employ delayed fusion strategies, resulting in insufficient feature integration. To address this issue, this paper introduces a novel high-resolution feature network for 3D point cloud, named HRFN3D, specifically for point cloud classification and segmentation tasks. HRFN3D innovatively employs a relational learning module to perform feature fusion at an early stage, facilitating interactions between low-resolution high-semantic points and high-resolution low-semantic points. This early fusion ensures that high-resolution points retain semantic information from the outset, facilitating subsequent feature learning. In the later stage, the global feature vectors are generated by combining different pooling strategies and spliced with the original point features, preserving the details and enhancing the representation of the global features. The experimental results show that HRFN3D improves the Class mean and Instance mean Intersection over Union by 2.2 percentage point and 0.9 percentage point, respectively, and achieves the average class ratio of 86.3%. On the ModelNet40 data set, our proposed method achieves the highest class average accuracy of 91.5% with 4.3M parameters. These results validate the effectiveness of HRFN3D in multi-scale feature processing.

       

    /

    返回文章
    返回