Visual Categorization with Bags of Keypoints

1.Introduction and backgrounds

作为本周的论文之一,这是一篇bag of features的基本文章之一,主要了解其中的基本思路,以及用到的基本技术,尽量使得细节更加清楚。


bag of keypoints的基本原理是:

A bag of keypoints corresponds to a histogram of the number
of occurrences of particular image patterns in a given image.

2. The main step

The main steps of our method are:
? Detection and description of image
patches 虽然patches 是小块的意思,但是这similar to the meaning of patten
? Assigning
patch descriptors to a set of predetermined clusters (a vocabulary)
第二部是对某一类的cluster 分配一个vocabulary
with a vector quantization algorithm
? Constructing a bag of keypoints, which counts the number of
patches assigned to each
? Applying a multi-class
classifier, treating the bag of keypoints as the feature vector, and thus
determine which category or categories to assign to the
image.采用一个多分类的分类器对histogram进行 classify,最后得到每种分类

3.Therefore the steps involved in training the system allow consideration of
multiple possible vocabularies:
? Detection and description of image patches
for a set of labeled training
? Constructing a set of vocabularies:
each is a set of cluster centres, with re-
spect to which descriptors are
vector quantized.
? Extracting bags of keypoints for these vocabularies
提取这些词汇的keypoints,这些keypoints的定义是如何呢?(We refer to the quantized feature vectors
(cluster centres) as “keypoints” by analogy with “keywords” in text
? Training multi-class classifiers using
the bags of keypoints as feature vectors 训练分类器,本文所采用两种分类器,分别所bayes分类器和SVM分类器

4.Feature extraction


5. Visual vocabulary construction

总体目标:the vocabulary is a way of constructing a feature vector for
classification that relates “new” descriptors in query images to descriptors
previously seen in
training 实际就是建立相应的descriptor

经过一系列的说明,作者选择了比较常用的k-means算法作为vocabulaory building 的算法



(1) 贝叶斯分类

considering visual categorization, assume we have a set of labeled images I =
Ii and a vocabulary V = vi of representative keypoints (i.e. cluster centers).
descriptor extracted from an image is labeled with the keypoint to which
it lies closest in feature space. We count the number N(t,i) of times keypoint
vi occurs in image Ii .

构造相应的分类特征,每个descriptor都被一个keypoint(空间内最近)所描述,然后计算每个keypoint在image I

P (C j | I i ) α P (C j )P (I i | C j

(2) SVM 分类


In order to apply the SVM to multi-class problems we take the one-against-all
approach. Given an m-class problem, we train m SVM’s, each distinguishes images
some category i from images from all the other m-1 categories j not
equal to i. Given a query image, we assign it to the class with the largest SVM



In the first we explore the impact of the number of clusters on classifier
accuracy and evaluate the performance of the Na?ve Bayes classifier. We then
explore the performance of the SVM on the same problem.

(1)贝叶斯方法 k=1000时效果比较好

(2)SVM中linear method gave the best performance (except in the case of cars
where a quadratic SVM gave better results)


时间: 2024-11-05 13:42:29

Visual Categorization with Bags of Keypoints的相关文章


*************************************************************************************************************** 在这里,我特别声明:本文章的源作者是   杨晓冬  (个人邮箱:[email protected]).原文的链接是版权归 杨晓冬 朋友所有. 我非常感谢原作者辛勤地编写本文章,并愿意共


*************************************************************************************************************** 在这里,我特别声明:本文章的源作者是   杨晓冬  (个人邮箱:[email protected]).原文的链接是版权归 杨晓冬 朋友所有. 我非常感谢原作者辛勤地编写本文章,并愿意共享

state-of-the-art implementations related to visual recognition and search Source Code Non-exhaustive list of state-of-the-art implementations related to visual recognition and search. There is no warranty for the source code links below – use them at you


通用库/General Library OpenCV   无需多言. RAVL  Recognition And Vision Library. 线程安全.强大的IO机制.包含AAM. CImg  很酷的一个图像处理包.整个库只有一个头文件.包含一个基于PDE的光流算法. 图像,视频IO/Image, Video IO FreeImage DevIL ImageMagick FFMPEG VideoInput portVideo AR相关/Augmented Reality ARToolKit 


转载,原文地址 -------------------------------------------------------------------------- 来源: http://www.yuanyong


来源: 一.特征提取Feature Extraction: SIFT [1] [Demo program][SIFT Library] [VLFeat] PCA-SIFT [2] [Project] Affine-SIFT [3] [Project] SURF [4] [OpenSURF] [Matlab Wrapper] Affine Covariant Features [5] [Oxford project] MS

CVPR 2015 papers

CVPR2015 Papers震撼来袭! CVPR 2015的文章可以下载了,如果链接无法下载,可以在Google上通过搜索paper名字下载(友情提示:可以使用filetype:pdf命令). Going Deeper With ConvolutionsChristian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke


一.特征提取Feature Extraction: l  SIFT [1] [Demo program][SIFT Library] [VLFeat] l  PCA-SIFT [2] [Project] l  Affine-SIFT [3] [Project] l  SURF [4] [OpenSURF] [Matlab Wrapper] l  Affine Covariant Features [5] [Oxford project] l  MSER [6] [Oxford project]


每个做过或者正在做研究工作的人都会关注一些自己认为有价值的.活跃的研究组和个人的主页,关注他们的主页有时候比盲目的去搜索一些论文有用多了,大牛的或者活跃的研究者主页往往提供了他们的最新研究线索,顺便还可八一下各位大牛的经历,对于我这样的小菜鸟来说最最实惠的是有时可以找到源码,很多时候光看论文是理不清思路的. 1 牛人Homepages(随意排序,不分先后): 1.USC Computer Vision Group:南加大,多目标跟踪/检测等: 2.ETHZ Computer Vision Lab