稀疏传感器动作捕捉物理约束的深度模型求解方法

    A Deep Model for Solving Physical Constraints in Human Motions Captured by Sparse Sensors

    • 摘要: 采用稀疏传感器捕捉人体动作,并驱动计算机中的虚拟人复现该动作,是虚拟现实等领域的关键技术之一;其中,复现动作的相关计算需同时满足运动约束与物理约束。目前,主要采用非线性优化方法解决此类计算的物理约束问题,该方法存在运算时间长、计算复杂度高、需要设计专用优化器等不足。为此,本文提出了一种基于深度神经网络模型求解稀疏传感器动作捕捉物理约束优化问题的方法。首先,将多模态特征融合网络和多路精练网络有效结合形成一种后段融合多层级结构作为基础模型。其次,通过渐进式融合的方法建立基础模型层间的反向投影连接,实现基础模型的迭代,使得深层融合的信息能够被浅层使用。第三,提出了一种包含物理约束的组合加权损失函数,以适用于存在隐式和显式物理约束情况下对深度模型的微调。实验结果表明,本文提出的方法不仅可行性好,而且计算效率提高了20 个百分点;与其他常用优化方法相比较,提出的方法在4种主流评价指标上均有良好的表现;在其他数据集上运行提出的方法,同样得到良好的结果,证明提出方法具有较好的泛化性。该方法为形成端到端的稀疏传感器动作捕捉深度模型提供了一种新的思路。

       

      Abstract: Using sparse sensors to capture human movements and driving virtual humans in computers to reproduce these movements is one of the key technologies in fields such as virtual reality, among which, the relevant calculations for reproducing movements must simultaneously satisfy motion constraints and physical constraints. Currently, nonlinear optimization methods are primarily employed to address the physical constraints in such computations. However, these methods suffer from several drawbacks, including long computation time, high computational complexity, and the necessity of designing dedicated optimizers. To address these issues, a method based on a deep neural network model for solving the physical constraints optimization in sparse sensor motion capture is proposed. Firstly, the model effectively combines the multi-modal feature fusion network and the multi-path refinement network to form a post-fusion multi-level structure, which serves as the basic model of this research. Secondly, through a progressive fusion method, back-projection connections between the layers of the basic model are established, enabling the iteration of the basic model. This allows the information fused at deeper layers to be utilized by the shallower layers. Thirdly, a loss function in the form of combined weighting that incorporates physical constraints is proposed, which is suitable for fine-tuning the deep model in the presence of both implicit and explicit physical constraints. The experimental results demonstrate that the proposed method not only exhibits good feasibility but also improves computational efficiency by approximately 20 percentage points. Compared with other commonly used optimization methods, the proposed method performs better in terms of four mainstream evaluation indicators. Additionally, the method yields favorable results when applied to other datasets, demonstrating its strong generalization capability. This method provides a new perspective for the formation of an end-to-end deep model for sparse sensor motion capture.

       

    /

    返回文章
    返回