Real-Time Human Detection using Relational Depth Similarity Features

Real-Time Human Detection using Relational Depth Similarity Features

本文是论文笔记。

转载请注明:http://blog.csdn.net/stdcoutzyx/article/details/40299191

本文使用TOF摄像头采集到的图像里有深度信息的特点,提出了一种新的深度相关的相似度特征(RDSF)来检测行人。同时,利用深度信息,可以判断人员遮挡的情况,将这种情况考虑进算法,可以同时提升Hog与RDSF的效果。

1 Ideas

论文使用可以采集深度信息的摄像头可以轻松的发现边缘与重叠部分,能否应用到人脸检测上?

使用深度信息只有当物体厚度较小时(如行人)才能轻松发现边缘,如果物体厚度较大,深度信息是否还有利于发现边缘,有什么算子可以帮助它?

2 Problems of Conventional methods

Conventional methods include features based on gradients like Hog, EOH, edgelets.

Problems:
  • Human occlusions
  • Complex backgrounds
  • Real-time processing because of the use of the raster scanning while varying the window scale

3 Extraction

3.1 Local features based on depth information

  • Divide depth image into local regions with cell of size 8*8 pixels
  • Compute depth histograms from the depth information
  • Normalize the histograms so that the total value of each depth histogram is 1

  • Compute the similarity of two depth histograms using equation below

3.2 Varied rectangular region sizes

  1. Control the window size to 64128 so it can be divided into 816 cells.
  2. There are 492 rectangular regions obtained by varying the cell units of the rectangular region from 11 to 88.
  3. Calculate the RDSF from combinations of the 492 rectangular regions, 492*491/2=120786 features.

Note:492=8×16+7×15+6×14+5×13+4×12+3×11+2×10+1×9

3.3 Faster depth histogram calculations by integral histograms

  1. 使用积分图来加快深度直方图的计算
  2. 深度以0.3米为单位,而TOF摄像头则是深度范围为0.3-7.5m,所以,共分为25个bin,所以共需25个积分图。

4 Human Detection

  1. Perform a raster scan of the detectin window in a 3D space
  2. Computes the RDSFs from the detection windows
  3. Judge whether there are occlusions in the calculated features
  4. Use Real AdaBoost to classify each detection window is of human or not human.

    流程如下图:

4.1 Raster scaning in 3D space

Convolutional human detection methods involve repeated raster scans while the scale of the detection window varied, so there are many windows do not match the dimisions of humans.

With the depth information, we can use fixed window size with different depth to detect humans with different scales. Process can be seen below:

Window with different depth can projected to the 2d image using a projection matrix which is the equation 7 in the paper.

4.2 Classification

Using Real Adaboost algorithm to classify the extracted features.

Adaboost algorithm can ensemble a number of weak classifiers to build a strong classifer.

H(x) is the final strong classifier, and h(x) is a weak classifier.

4.3 Classification adjusted for occlusions

Depth information is useful in a confusing scene with a number of people overlapping.Combine the overlapping information into the classifier in a simple way. Process is below:

  • Define occlusion: any object region that is closer to the camera than the detection window
  • Extraction of occlusion regions:

  • Calculate the proportion of occlusion regions:

  • Combine the rate OR into the Adaboost algorithm:

5 Mean-shift clustering

Mean-shift clustering is a method that clustering the detect windows which detect a same object into one window. In image space, detection window could be erroneously integrated if humans overlap in them. But in 3D space, this problem can be solved easily.

6 Experiments

There are two expriments:
  • Comparison of three feature extraction methods:

    • HOG
    • RDFS
    • HOG+RDFS
  • Comparison of occlusion and non-occlusion adjustment feature extraction methods:
    • HOG without occlusion adjustment
    • HOG with occlusion adjustment
    • RDFS without occlusion adjustment
    • RDFS with occlusion adjustment
时间: 2024-08-04 11:10:28

Real-Time Human Detection using Relational Depth Similarity Features的相关文章

A Novel Human Detection Approach Based on Depth Map via Kinect

In this paper, a new method of human detection based on depth map from 3D sensor Kinect is proposed. First, the pixel filtering and context filtering are employed to roughly repair defects on the depth map due to information inaccuracy captured by Ki

[2005CVPR]Histograms of Oriented Gradients for Human Detection

HOG这种方法跟边缘方向直方图,尺度不变特征变换(SIFT)以及形状上下文方法(shape contexts)有很多相似之处,但与它们的不同点是:HOG描述器是在一个网格密集的大小统一的细胞单元上计算,而且为了提高性能,还采用了重叠的局部对比度归一化.HOG方法是在图像的局部细胞单元上操作,所以它对图像几何和光学的形变都能保持很好的不变性. 算法步骤step1:Gamma/Colour Normalization 作者分别在灰度空间.RGB色彩空间和LAB色彩空间上对图像进行色彩和 伽马归一化,

Histograms of Oriented Gradients for Human Detection

Histograms of Oriented Gradients for Human Detection 本文是论文笔记. 转载注明:http://blog.csdn.net/stdcoutzyx/article/details/40299383 Hog算子进行行人检测,是05年在CVPR上发表的经典文章.使用Hog算子提取特征,然后使用SVM来进行分类. 1. Definition Hog Descriptor:locally normalized histogram of gradient

Histogram of Oriented Gradients for Human Detection 翻译

用于人体检测的方向梯度直方图 Navneet Dalal,Bill Triggs 摘要 我们研究了视觉目标检测的特征集问题,并用线性SVM方法进行人体检测来测试,通过与当前的基于边缘和梯度的描述子进行实验对比,得出方向梯度直方图(Histograms of Oriented Gradient,HOG)描述子在行人检测方面表现更加突出.我们研究了计算过程中每一阶段的影响,得出小尺度梯度(fine-scale gradients).精细方向采样(fine orientation binning).粗

行人检测(Pedestrian Detection)资源整理

一.论文 综述类的文章: [1] D. Geronimo, and A. M.Lopez. Vision-based Pedestrian Protection Systems for Intelligent Vehicles, BOOK, 2014. [2] P.Dollar, C. Wojek,B. Schiele, et al. Pedestrian detection: an evaluation of the state of the art [J].IEEE Transactions

Spatio-temporal feature extraction and representation for RGB-D human action recognition

propose a novel and effective framework to largely improve the performance of human action recognition using both the RGB videos and depth maps. The key contribution is the proposition of the sparse coding-based temporal pyramid matching approach (Sc

ECCV 2014 Results (16 Jun, 2014) 结果已出

Accepted Papers     Title Primary Subject Area ID 3D computer vision 93 UPnP: An optimal O(n) solution to the absolute pose problem with universal applicability 128 Video Registration to SfM Models 168 Image-based 4-d Modeling Using 3-d Change Detect

总结一些机器视觉库

通用库/General Library OpenCV   无需多言. RAVL  Recognition And Vision Library. 线程安全.强大的IO机制.包含AAM. CImg  很酷的一个图像处理包.整个库只有一个头文件.包含一个基于PDE的光流算法. 图像,视频IO/Image, Video IO FreeImage DevIL ImageMagick FFMPEG VideoInput portVideo AR相关/Augmented Reality ARToolKit 

转载:2013计算机视觉代码合集

转载,原文地址http://blog.csdn.net/daoqinglin/article/details/23607079 -------------------------------------------------------------------------- 来源: http://www.yuanyong.org/cv/cv-code-one.html http://www.yuanyong.org/cv/cv-code-two.html http://www.yuanyong