篮球比赛视频中相机观察视角优化及运动员轨迹重建

    Camera Viewpoint Optimization and Player Motion Track Reconstruction in Basketball Game Videos

    • 摘要: 重建篮球运动员在比赛中的移动轨迹对于分析比赛势态、提高球队技战术水平有重要作用。在比赛过程中,通常会有多个相机从不同视角同时拍摄球场上的运动员。选择最优的相机观察视角是准确重建运动员移动轨迹的关键环节之一。当前单视角主流方法存在视野范围有限、易出现身份混淆和视野盲区等问题。为此,本文提出了一种自动选择最优观察视角的方法。首先,通过跨相机视角重识别同一名运动员以验证该方法可行性;其次,为了筛选所需的备选视角,融合损失注意力机制(Loss Attention Mechanism, LAM) 改进了多示例学习模型(Multi-Instance Learning, MIL) 及其损失函数;第三,构建分数评估机制,并以此为依据筛选出最优观察视角。实验结果表明,本文方法在公开数据集APIDIS上高阶跟踪精度(Higher Order Tracking Accuracy, HOTA) 指标相比单视角主流方法(如ByteTrack: Multi-Object Tracking by Associating Every Detection Box等) 提高了10%左右,IDF1(Identity F1 Score) 指标提高了30%左右,说明了本文的方法不仅可行且性能优良;同时在数据集二上可以得到相似的结果,说明了本文模型具有良好的泛化性能。本文提出的最优观察视角选择模型不仅在一定程度上解决了单视角下的身份(Identity Document, ID) 混淆和视野盲区问题,而且为重建运动员的全局移动轨迹奠定了基础。

       

      Abstract: Reconstructing the movement trajectories of basketball players during matches plays a significant role in analyzing game dynamics and optimizing team tactics. While multi-camera systems capture player movements from various perspectives, selecting the optimal observation viewpoint is crucial for accurate player trajectory reconstruction. Existing single-view methods face inherent limitations, such as restricted field of view, identity confusion, and occlusion-induced blind spots. To address these issues, This paper proposes an automatic optimal observation viewpoint selection method. Firstly, cross-camera re-identification is employed to verify the feasibility of consistently tracking the same player across views. Secondly, we improve the Multi-Instance Learning (MIL) framework by integrating a Loss Attention Mechanism (LAM) to enhance candidate viewpoint filtering through loss function optimization. Finally, a scoring evaluation mechanism is established to select the most suitable tracking viewpoint. Experimental results on the public APIDIS dataset demonstrate that our method outperforms mainstream single-view tracking techniques such as ByteTrack by achieving approximately 10% higher HOTA and 30% improvement in IDF1. Comparable results are obtained on an additional dataset, confirming the robustness of the approach. The proposed model not only mitigates ID confusion and occlusion challenges common in single-view systems but also provides a foundation for reconstructing global player trajectories. This advancement offers practical value for technical-tactical analysis in sports science.

       

    /

    返回文章
    返回