基于缺失区域图像引导的点云补全方法

    A Point Cloud Completion Method Guided by Image Information of Missing Regions

    • 摘要: 视图引导的点云补全(View-guided Point Cloud Completion, ViPC) 通过引入图像模态提升点云补全质量,但现有方法在跨模态注意力机制中引入图像背景干扰,且未能充分利用与点云缺失区域对应的图像信息,同时其补全解码器在结构建模能力上仍显不足。针对上述问题,本文提出跨模态引导模块反转取关键(Reverse to Get the Key,RGK) ,由反转交叉注意力机制(Reversed Cross Attention, RevCA) 与缺失区域图像辅助补全解码器(Missing Region Image guided Completion Decoder, MRICD) 构成。RevCA在标准交叉注意力基础上引入逐点动态相似度门控以抑制非关键图像区域注意力,并通过反向注意力权重重分布强化与缺失区域相关的图像特征,从而提升跨模态交互的有效性。MRICD基于RevCA关键思想从初始图像序列(token) 筛选出缺失区域图像辅助序列,并利用交叉注意力机制实现辅助信息交互,实现高精度补全。在ShapeNet-ViPC数据集上的实验结果表明,将RGK集成至基线网络后整体性能均优于基线模型及其他对比方法,证明了RGK能有效提升跨模态补全过程的关键特征提取能力与点云结构恢复质量。

       

      Abstract: View-guided point cloud completion (ViPC) enhances completion quality by incorporating image modality, yet existing methods often introduce background interference in cross-modal attention, underutilize image cues aligned with missing regions, and employ decoders with limited structural modeling capacity. To address these limitations, in this work, RGK (Reverse to Get the Key) , a cross-modal guidance module composed of Reversed Cross Attention (RevCA) and a Missing Region Image-guided Completion Decoder (MRICD) is proposed. RevCA augments standard cross-attention through point-wise dynamic similarity gating and reversed attention redistribution to emphasize features associated with missing regions. MRICD further selects an auxiliary image token sequence based on RevCA and performs cross-attention-based interaction for accurate completion. Experiments on ShapeNet-ViPC show that integrating RGK into baseline networks consistently improves performance over baselines and competing methods, demonstrating its effectiveness in extracting key cross-modal features and restoring point cloud structures.

       

    /

    返回文章
    返回