[CVPR2015] Is object localization for free? – Weakly-supervised learning with convolutional neural networks论文笔记

p.p1 { margin: 0.0px 0.0px 0.0px 0.0px; font: 15.0px "Helvetica Neue"; color: #323333 }
p.p2 { margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px "Helvetica Neue"; color: #323333 }
li.li2 { margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px "Helvetica Neue"; color: #323333 }
span.s1 { }
span.s2 { background-color: #fefa00 }
ul.ul1 { list-style-type: disc }
ul.ul2 { list-style-type: circle }

亮点

一个好名字给了让读者开始阅读的理由
global max pooling over sliding window的定位方法值得借鉴

方法

本文的目标是：设计一个弱监督分类网络，注意本文的目标主要是提升分类。因为是2015年的文章，方法比较简单原始。

Following three modifications to a classification network.

Treat the fully connected layers as convolutions, which allows us to deal with nearly arbitrary-sized images as input.

The aim is to apply the network to bigger images in a sliding window manner thus extending its output to n×m× K, where n and m denote the number of sliding window positions in the x- and y- direction in the image, respectively.
3xhxw —> convs —> kxmxn (k: number of classes)

Explicitly search for the highest scoring object position in the image by adding a single global max-pooling layer at the output.

kxmxn —> kx1x1
The max-pooling operation hypothesizes the location of the object in the image at the position with the maximum score

Use a cost function that can explicitly model multiple objects present in the image.

因为图中可能有很多物体，所以多类的分类loss不适用。作者把这个任务视为多个二分类问题，loss function和分类的分数如下

p.p1 { margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px "Helvetica Neue"; color: #323333 }
p.p2 { margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px "Helvetica Neue"; color: #323333; min-height: 15.0px }
p.p3 { margin: 0.0px 0.0px 0.0px 0.0px; font: 15.0px "Helvetica Neue"; color: #323333 }
li.li1 { margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px "Helvetica Neue"; color: #323333 }
span.s1 { }
ul.ul1 { list-style-type: disc }

training

muti-scale test

实验

classification

mAP on VOC 2012 test: ＋3.1% compared with [56]
mAP on VOC 2012 test: ＋7.6% compared with kx1x1 output and single scale training
mAP on VOC: ＋2.6% compared with RCNN
mAP on COCO 62.8%

Localisation

Metric: if the maximal response across scales falls within the ground truth bounding box of an object of the same class within 18 pixels tolerance, we label the predicted location as correct. If not, then we count the response as a false positive (it hit the background), and we also increment the false negative count (no object was found).
metric on VOC 2012 val: -0.3% compared with RCNN
mAP on COCO 41.2%

缺点

定位评测的metric不具有权威性
max pooling改为average pooling会不会对于多个instance的情况更好一些

原文地址：https://www.cnblogs.com/Xiaoyan-Li/p/8710909.html

时间： 2024-10-09 17:15:38

[CVPR2015] Is object localization for free? – Weakly-supervised learning with convolutional neural networks论文笔记的相关文章

tensorfolw配置过程中遇到的一些问题及其解决过程的记录（配置SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving）

今天看到一篇关于检测的论文<SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving>,论文中的效果还不错,后来查了一下,有一个Tensorflow版本的实现,因此在自己的机器上配置了Tensorflow的环境,然后将其给出的demo跑通了,其中遇到了一些小问题,通过查找网络上的资料解决掉了,在这里

[CVPR2015] Is object localization for free? – Weakly-supervised learning with convolutional neural networks论文笔记

[CVPR2015] Is object localization for free? – Weakly-supervised learning with convolutional neural networks论文笔记的相关文章

tensorfolw配置过程中遇到的一些问题及其解决过程的记录（配置SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving）

课程四(Convolutional Neural Networks)，第三周（Object detection） —— 0.Learning Goals

课程四(Convolutional Neural Networks)，第三周（Object detection） —— 1.Practice questions：Detection algorithms

论文笔记 Deep Patch Learning for Weakly Supervised Object Classication and Discovery

Global Average Pooling Layers for Object Localization

[Arxiv1706] Few-Example Object Detection with Model Communication 论文笔记

Machine Learning Algorithms Study Notes(2)--Supervised Learning

论文笔记之：Collaborative Deep Reinforcement Learning for Joint Object Search

1. Supervised Learning - Linear Regression