Background subtraction techniques review

Agenda zThe problem zThe basic methods zRunning Gaussian average zMixture of Gaussians zKernel Density Estimators zMeanshift based estimation zCombined estimation and propagation The problem requirements Thebackground image is not fixed but mustadapt to: zIllumination changes • gradual • sudden (such as clouds) zMotion changes • camera oscillations • highfrequencies background objects (such as tree branches, sea waves, and similar) zChanges in the background geometry

Trang 1

Background subtraction techniques:

a review

Massimo Piccardi

Computer Vision Research Group (CVRG)

University of Technology, Sydney (UTS)

e-mail: massimo@it.uts.edu

The ARC Centre of Excellence for Autonomous Systems (CAS)

Faculty of Engineering, UTS, April 15, 2004

Trang 2

Agenda

Trang 3

The problem

camera, detecting all the foreground objects

foreground objects as the difference between the

current frame and an image of the scene’s static

background:

| framei – backgroundi | > Th

obtain the image of the scene’s static background?

Trang 4

The problem - requirements

The background image is not fixed but must adapt to:

Trang 5

The basic methods

Frame difference:

| framei – framei-1 | > Th

objects’ speed and frame rate

Trang 6

The basic methods (2)

Frame difference: an example

absolute difference the frame

threshold:

Trang 7

The basic methods (3)

2000; Cucchiara, 2003) of the previous n frames:

• rather fast, but very memory consuming: the memory

requirement is n * size(frame)

Bi + 1 = α * Fi + (1 - α ) * Bi

• α, the learning rate, is typically 0.05

• no more memory requirements

Trang 8

The basic methods – rationale

based on the pixel’s recent history

• just the previous n frames

• a weighted average where recent frames have higher weight

chronological average from the pixel’s history

(neighbouring) pixel locations

Trang 9

The basic methods - histograms

Trang 10

The basic methods - selectivity

foreground or background

background model?

in the background model

polluted by pixel logically not belonging to the

background scene

Trang 11

The basic methods – selectivity (2)

( ) x , y α F ( ) ( x , y 1 α ) ( ) B x , y

Bi+1 = t + − t if Ft (x,y) background

( ) x , y B ( ) x , y

Bi+1 = t if Ft (x,y) foreground

Trang 12

The basic methods - limitations

threshold

with multiple modal background distributions;

example:

Trang 13

Running Gaussian average

z Pfinder (Wren, Azarbayejani, Darrell, Pentland, 1997):

• fitting one Gaussian distribution ( µ , σ ) over the histogram: this gives the background PDF

• background PDF update: running average:

• In test | F - µ | > Th, Th can be chosen as kσ

• It does not cope with multimodal backgrounds

( ) t t

2 1

Trang 14

Mixture of Gaussians

z Mixture of K Gaussians (µi,σi ,ωi ) (Stauffer and Grimson, 1999)

background distributions; however:

(usually from 3 to 5)

Trang 15

Mixture of Gaussians (2)

z All weights ωi are updated (updated and/or normalised) at

every new frame

z At every new frame, some of the Gaussians “match” the

current value (those at a distance < 2.5 σi ): for them, µi, σi are updated by the running average

z The mixture of Gaussians actually models both the

foreground and the background: how to pick only the

distributions modeling the background?:

• all distributions are ranked according to their ωi / σi and the first ones chosen as “background”

Trang 17

Kernel Density Estimators

2000):

n most recent pixel values, each smoothed with a

Gaussian kernel (sample-point density estimator)

to compute the kernel values (mitigated by a LUT

approach)

Trang 18

Mean-shift based estimation

Piccardi, Jan, submitted 2004)

• a gradient-ascent method able to detect the modes of a multimodal distribution together with their covariance

matrix

• iterative, the step decreases towards convergence

• the mean shift vector:

x h

x x

g

h x

x g

x x

2

) ) ((

) )

((

) (

Trang 19

Mean-shift based estimation (2)

9.66 10.05 11.21 11.70: convergence

initial position: 9

Trang 20

Mean-shift based estimation (3)

• a standard implementation (iterative) is way too slow

• memory requirements: n * size(frame)

• computational optimisations

• using it only for detecting the background PDF modes at initialisation time; later, use something computationally lighter (mode propagation)

Trang 21

Combined estimation and propagation

• heuristic procedures are used for merging the existing

modes (the number of modes is not fixed a priori)

• faster than KDE, low memory requirements

∑

− +

= ( new _ mod e ) ( 1 )( existing _ mod es ) )

x (

Trang 22

Combined estimation and propagation - 2

(from: B Han, D Comaniciu, and L Davis, "Sequential kernel density approximation through mode propagation: applications to background modeling,“ Proc ACCV 2004)

Trang 23

Pentland, 2000)

eigenvector decomposition is a way to reduce the dimensionality of a space

compute the eigenbackgrounds

than a Mixture of Gaussians approach

Trang 24

Eigenbackgrounds – main steps

1. The n frames are re-arranged as the columns of a matrix, A

2. The covariance matrix, C = AA T, is computed

3. From C, the diagonal matrix of its eigenvalues, L, and the

eigenvector matrix, Φ , are computed

4. Only the first M eigenvectors (eigenbackgrounds) are

retained

5. Once a new image, I, is available, it is first projected in the

M eigenvectors sub-space and then reconstructed as I’

6. The difference I – I’ is computed: since the sub-space well

represents only the static parts of the scene, the outcome of this difference are the foreground objects

Trang 25

Spatial correlation?

correlation between neighboring pixels How can that

be exploited?

resulting foreground image

Harwood, Davis, 2000)

matrix

background detection based on the cooccurrence of

Trang 26

Methods reviewed:

Trang 27

Summary (2)

From the data available from the literature

eigenbackgrounds, SKDA, optimised mean-shift

eigenbackgrounds, SKDA

Trang 28

Summary (3)

significant benchmark is needed!

SKDA, mean-shift

certainly can offer good accuracy as well

average, median can provide acceptable accuracy

in specific applications

Trang 29

Main references

platforms,” Proc of 2001 Int Symp on Intell Multimedia, Video and Speech Processing, pp 158-161, 2000.

shadows in video streams”, IEEE Trans on Patt Anal and Machine Intell., vol 25, no 10, Oct 2003, pp 1337-1342.

Robust Automatic Traffic Scene Analysis in Real-Time,” in Proceedings of Int’l Conference on Pattern Recognition, 1994, pp 126–131.

Human Body,” IEEE Trans on Patt Anal and Machine Intell., vol 19, no 7, pp 780-785, 1997.

Proc of CVPR 1999, pp 246-252.

Trans on Patt Anal and Machine Intell., vol 22, no 8, pp 747-757, 2000.

Subtraction”, Proc of ICCV '99 FRAME-RATE Workshop, 1999.

Trang 30

Main references (2)

propagation: applications to background modeling,“ Proc ACCV - Asian Conf on Computer Vision, 2004.

Modeling Human Interactions,” IEEE Trans on Patt Anal and Machine Intell., vol 22, no 8,

pp 831-843, 2000.

image variations”, Proc of CVPR 2003, vol 2, pp 65-72.

Tiêu đề	Background Subtraction Techniques Review
Tác giả	Massimo Piccardi
Trường học	University of Technology, Sydney
Chuyên ngành	Computer Vision
Thể loại	review paper
Năm xuất bản	2004
Thành phố	Sydney

Định dạng
Số trang	30
Dung lượng	365,38 KB