广东工业大学学报 ›› 2015, Vol. 32 ›› Issue (04): 150-154.doi: 10.3969/j.issn.1007-7162.2015.04.027

• 综合研究 • 上一篇    

基于视觉字典容量自动获取的LDA场景分类研究

张艺,钟映春,陈俊彬   

  1. 广东工业大学自动化学院,广东 广州 510006
  • 收稿日期:2014-09-09 出版日期:2015-12-04 发布日期:2015-12-04
  • 作者简介:张艺(1987-),男,硕士研究生,主要研究方向为图像理解.
  • 基金资助:

    广东省科技计划项目(2010A030500006 )

Research on Scene Classification of LDA Automatically Obtained by Visual Dictionary Capacity

Zhang Yi, Zhong Ying-chun, Chen Jun-bin   

  1. School of Automation, Guangdong University of Technology, Guangzhou 510006, China
  • Received:2014-09-09 Online:2015-12-04 Published:2015-12-04

摘要: 提出了一种高效获取词包模型中视觉字典容量的方法,并研究了该方法与隐狄利克雷分配模型(Latent Dirichlet Allocation,LDA )相结合情况下的场景分类性能.在用SIFT特征构建场景图像数据集特征矩阵的基础上,首先采用吸引子传播方法获取场景图像集特征矩阵的合理聚类数目族,并将其中的最小聚类数目作为视觉字典容量,进而生成视觉字典;然后利用所构建视觉字典中的单词描述场景图像训练集和测试集;最后采用LDA模型对场景图像测试集进行场景分类实验.实验结果表明,提出的方法不仅保持了较高场景分类准确率,同时显著提高了场景分类的效率.

关键词: 词包模型; 视觉单词; 视觉字典; 隐狄利克雷分配模型

Abstract: An approach is proposed to obtain the dictionary capacity of bag of words(BoW) model efficiently, which is combined with The Latent Dirichlet Allocation (LDA) model to analyze the performance of scene category. Based on the feature matrix of scene image data sets constructed by SIFT feature, the affinity propagation method is firstly employed to obtain the clustering numbers, and to take the minimal clustering number as the visual dictionary capacity before generating a visual dictionary. Secondly, the scene training and testing sets are described by these visual words. Finally, the LDA model is employed to classify the testing data set. The experiments show that the proposed approach in this paper maintains higher accuracy of scene classification and can improve efficiency greatly.

Key words: bag of words; visual words; visual dictionary; LDA model

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!