基于模糊弱监督标签相关性精炼的偏多标签学习

    Partial Multi-label Learning with Fuzzy Weakly Supervised Label Correlation Refinement

    • 摘要: 偏多标签学习的目标是训练一个抗噪声的分类器,即使在候选标签集仅部分标签有效的情况下,也能准确地为未知实例分配标签。目前大多数现有的方法都依赖于标签相关性假设:标签类别之间的相关性在不同数据集中具有一致性。然而,噪声的存在导致得到的先验标签相关性变得不可靠。为了解决这个问题,本文提出了一种新颖的偏多标签学习方法,称为基于模糊弱监督标签相关性精炼的偏多标签学习。首先,建立一个模糊框架通过聚类分析来学习模糊标签隶属度,由于它衡量实例和类原型之间的距离,因此可以将其视为模糊弱监督标签信息。本文首次提出在保留一定原标签空间全局结构的基础上,用由模糊标签隶属度度量的模糊标签相关性替代先验标签相关性。通过促使样本流形在标签置信度上一致性对应,这种新的标签相关性与样本相似性共同用于学习更精确的标签置信度。最后,利用学习到的标签置信度来训练一个针对未见实例的核化线性分类器。本文利用一种有效的迭代优化算法对所提出的方法进行求解。实验证明,与其他先进的偏多标签方法相比,所提出的模型表现出更优越的性能。

       

      Abstract: The goal of partial multi-label learning is to train a noise-robust classifier that can accurately assign labels to unknown instances, even when the candidate label set is only partially valid. Currently, most existing approaches rely on the label correlation assumption: the correlations between label categories maintain consistency across different datasets. However, the presence of noise makes the obtained prior label correlation unreliable. To tackle this problem, a novel partial multi-label learning approach with fuzzy weakly supervised label correlation refinement is proposed. First, a fuzzy framework is established to learn fuzzy label membership through clustering analysis, which can be viewed as fuzzy weakly supervised label information since it measures the distances between instances and class prototypes. This research first proposes to replace the prior label correlation with fuzzy label correlation measured by fuzzy label membership while preserving certain global structures of the original label space. By encouraging consistency correspondence of sample manifolds in label confidence, this new label correlation, together with sample similarity, is jointly employed to learn more precise label confidence. Finally, the learned label confidence is exploited to train a kernelized linear classifier for unseen instances. This proposed method is solved by using an effective iterative optimization algorithm. Extensive experiments demonstrate that the proposed model exhibits superior performance compared with other advanced partial multi-label learning methods.

       

    /

    返回文章
    返回