基于高效时序建模的RGB-T跟踪

    Efficient Temporal Modeling for RGB-T Tracking

    • 摘要: 可见光−热红外(RGB-Thermal,RGB-T)跟踪方法利用可见光和热红外图像的互补性,提升了在低光照以及恶劣天气等场景下目标跟踪的准确度。然而,现有的研究大多局限于图像级外观匹配,难以应对目标形变、复杂环境干扰等挑战。对此,本文提出了一种基于高效时序建模的跟踪方法(Efficient Temporal Modeling for RGB-T Tracking, ETMTrack) 。首先,利用时序信息建模,并改进特征融合模块以处理时序信息。然后,利用轻量级适配器微调改进热红外图像特征提取模块,提高模型对不同模态信息的特征提取能力,并降低计算显存,提高训练效率。最后,提出一种动态模板更新和选择方法,充分挖掘并利用时序信息,提高模型性能。本文模型在3个公开数据集上达到了最先进的性能,在遮挡、相似外观等挑战上表现出色,验证了时序建模跟踪算法的有效性和鲁棒性。

       

      Abstract: RGB-Thermal (RGB-T) tracking methods utilize the complementarity of visible light and thermal infrared images to improve the accuracy of target tracking in the scenarios of low light conditions and adverse weather. However, most existing studies focus only on image-level appearance matching, making them difficult to cope with challenges of target deformation and interference under complex environments. To address this problem, a tracking method based on efficient temporal modeling is proposed. Firstly, the temporal information is modeled, and the feature fusion module is improved to process temporal information. Then, a lightweight adapter is used for fine-tuning to improve the feature extraction module for thermal infrared images, enhancing the model's ability to extract features from different modal information, reducing the computational memory usage, and improving the training efficiency. Finally, a dynamic template update and selection method is proposed to fully explore and utilize temporal information, thereby improving the model's performance. ETMTrack achieves state-of-the-art performance on three public datasets, and performs excellently in dealing with challenges such as occlusion and similar appearances, demonstrating the effectiveness and robustness of the tracking algorithm based on temporal modeling.

       

    /

    返回文章
    返回