• No results found

Part Based Object and People Detection Cognitive Science Summerschool, Aug 27, 2oo9 Part 1: Introduction & Overview

N/A
N/A
Protected

Academic year: 2021

Share "Part Based Object and People Detection Cognitive Science Summerschool, Aug 27, 2oo9 Part 1: Introduction & Overview"

Copied!
25
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Perceptual and Sensory

Augmented Computing

Bernt Schiele

TU Darmstadt, Germany

http://www.mis.informatik.tu-darmstadt.de/

schiele@informatik.tu-darmstadt.de

Part Based Object and People Detection

Cognitive Science Summerschool, Aug 27, 2oo9

Part 1: Introduction & Overview

(2)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 2

(Grayscale) Image

‘Goals’ of Computer Vision

how can we recognize fruits

from an array of (gray-scale)

numbers?

how can we perceive depth

from an array of (gray-scale)

numbers?

computer vision =

the problem of

‘inverse graphics’ …?

‘Goals’ of Graphics

how can we generate an array of

(gray-scale) numbers that looks like

fruits?

how can we generate an array of

(gray-scale) numbers so that the

human observer perceives depth?

(3)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 3

Computer Vision & Object Recognition

is it more than inverse

graphics?

how do you recognize

the banana?

the glas?

the towel?

how can we make computers

(4)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 4

(5)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 5

(6)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 6

(7)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 7

(8)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 8

(9)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 9

Recognition: the Role of Context

(10)

Bernt Schiele - TU Darmstadt Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1

Recognition: the Role of Context

Antonio Torralba (MIT) & Rob Fergus (NYU)

(11)

Bernt Schiele - TU Darmstadt Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1

Recognition: the Role of Context

Antonio Torralba (MIT) & Rob Fergus (NYU)

(12)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 12

(13)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 13

(14)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 14

(15)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 15

Class of Models: Pictorial Structure

Fischler & Elschlager 1973

Model has two components

parts

(2D image fragments)

structure

(configuration of parts)

(16)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 16

Deformations

(17)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 17

(18)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 18

(19)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 19

Object Recognition:

Focus of today’s lecture

Different Types of Recognition Problems:

Object

Identification

recognize your apple,

your cup, your dog

Object

Classification

recognize any apple,

any cup, any dog

also called:

generic object recognition,

object categorization

, …

typical definition:

‘basic level category’

(20)

Bernt Schiele - TU Darmstadt Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1

Which Level is right for Object Classes?

Basic-Level Categories

the highest level at which category members have similar perceived shape

the highest level at which a single mental image can reflect the entire category

the highest level at which a person uses similar motor actions to interact with

category members

the level at which human subjects are usually fastest at identifying category

members

the first level named and understood by children

(while the definition of basic-level categories depends on culture there exist a

remarkable consistency across cultures...)

Most recent work in object recognition has focused on this problem

we will discuss several of the most successful methods in the lecture ;-)

(21)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 21

Object Recognition:

Focus of this Computer Vision class

Recognition and

‣ Segmentation

: separate pixels belonging to the foreground (object)

and the background

(22)

Bernt Schiele - TU Darmstadt

Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1 22

Object Recognition:

Focus of this Computer Vision class

Recognition and

‣ Localization

: position of the object

in the scene, pose estimate

(orientation, size/scale, 3D position)

(23)

Bernt Schiele - TU Darmstadt Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1

Localization: Example Video 1

(24)

Bernt Schiele - TU Darmstadt Part-Based Object and People Detection - Aug 27, 2oo9 - Part 1

Overview

Introduction (part 1)

why study computer vision in general

and object recognition in particular :)

Object Recognition Methods

Bag of Words Models

(

BoW

) (part 2)

Model: Histogram of local features

e.g. Interest Points (scale invariant)

Global Feature Models

+ Classifier (part 3)

e.g. HOG = Histogram of Oriented Gradients

– global object feature / description

e.g. SVM = Support Vector Machines

– discriminant classifier - widely used

Part-Based Object Models

(part 4)

e.g. Implicit Shape Model (ISM)

local parts & global constellation of parts

24

BoW: no spatial

relationships

e.g. HOG: fixed

spatial relationships

e.g. ISM: flexible

spatial relationships

(25)

Bernt Schiele | Part-Based Object and People Detection | Aug 27, 2oo9 |

Overview of lecture parts 3 & 4...

Global Feature Based Methods

for People Detection

(part 3)

A Performance Evaluation of Single and Multi-Feature People Detection

[Wojek,Schiele@DAGM-08]

Pedestrian Detection: A New Benchmark

[Dollar,Wojek,Perona,Schiele@CVPR-09]

Multi-Cue Onboard Pedestrian Detection

[Wojek,Walk,Schiele@CVPR-09]

Part-Based Model

for People & Object Detection

(part 4)

Detection by Tracking and Tracking by Detection

[Andriluka,Roth,Schiele@CVPR-08]

Pictorial Structures Revisited: People Detection and Articulated Pose

Estimation [Andriluka,Roth,Schiele@CVPR-09]

A Shape-Based Object Class Model for Knowledge Transfer

[Stark,Goesele,Schiele@ICCV-09]

Referenties

GERELATEERDE DOCUMENTEN

The best results at one false positive per image are obtained when using the boosted-cascade detector combined with our decision-tree model with step size 1 using the fine

arranged according to a generalised trend for the metathesis reactivity for the self- metathesis of linear alkenes (Fig. Generalised trend for the metathesis reactivity for

The reaction was modelled by using density functional theory (DFT) and second-order Moller-Plesset theory. In order for the olefin to be able to coordinate to the metal carbene one

skaappoot, beespoot, ens. Brabbeltaal is 'n ongebonde spreekwyse wat horn nie aan die taalreels steur nie-veral nie aan die manier waarop die ·w oorde uitgespreek

The main findings are: (1) ablation with the PVAC-Gold catheter is associated with higher incidence of cerebral lesions (23% vs. 6%) and in addition, in the PVAC-Gold group

‣ Multidimensional Receptive Field Histogram Approach [Schiele&Crowley’96-’00].. ‣ Bag of

Full body pose estimation: Comparison of body part detection rates and evaluation of different components of the model on the “Iterative Image Parsing” (IIP) dataset [15]

First, we consider the problem of determining the minimum number N d of non-overlapping unit disks that is required to block all rays emanating from a point p, where each disk has