A method for generating a density image of an observation zone over a given time interval, in which method a plurality of images of the observation zone is acquired, for each image acquired the following steps are carried out: a) detection of zones of pixels standing out from the fixed background of the image, b) detection of individuals, c) for each individual detected, determination of the elementary surface areas occupied by this individual, and d) incrementation of a level of intensity of the elementary surface areas thus determined in the density image.
|
1. Method for generating a density image of an observation zone through which individuals pass over a given time interval, thereby acquiring a plurality of images of the observation zone by means of a camera, the following steps being carried out for each acquired image:
extracting a fixed background from the image,
a step a) of detecting zones of pixels, “blobs”, standing out from the fixed background of the image,
a step b) of detecting the individuals, in which:
b1) for each blob, several hypotheses for possible positions are generated within a three-dimensional model of the observation zone by using the features of the camera and a standard size of the individual;
and for each possible position, a head of an individual is detected by applying a three-dimensional model of an individual onto the corresponding blob,
b2) counting the individuals for each blob,
c) for each detected individual, determining elementary areas occupied by this individual, and
d) a density image is generated in which the thus-determined elementary areas are incremented by a level of intensity.
2. Method according to
3. Method according to
application of a Canny filter to the image with respect to the zones of pixels so as to generate an image of contours of the zones of pixels,
applying a distance transform so as to prepare a chart of distances,
performing a convolution between the distance chart and a model of a head (template) so as to obtain a convolution chart, and
applying a watershed algorithm so as to detect heads of individuals.
4. Method according to
5. Method according to
a correspondence is established between pixels of the image and points of the corresponding space,
it is assumed that the pixels of a blob correspond to a visible point of an individual and potentially to his feet or head, and
these pixels are projected onto the ground planes and planes placed at standard head height, the common projection zone of the feet and of the head being identified as a possible position.
6. Method according to
7. Method according to
8. Method according to
minimizing the number of individuals,
maximizing the number of pixels different from the background and belonging to the silhouettes,
minimizing the number of pixels different from the background and not belonging to the ellipses,
minimizing the number of pixels belonging to the background and to the ellipses, and
minimizing the distance of the contours of a 2D/3D model.
9. Method according to
10. Method according to
11. Method according to
12. Method according to
13. Method according to
14. Method according to
15. Method according to
|
The present invention relates to a method for generating a density image of an observation zone over a given time interval. It can be used with particular benefit in, but is not limited to, the field of behavioural analysis by computer vision. More precisely, the present invention can be applied to systems in which at least one camera is available, arranged at height close to an object of interest which can be an advertising screen or a merchandising product in a store for example. This camera is arranged so as to film any individual passing close to the object of interest. The video stream from the camera is processed, in particular, in order to carry out behavioural analyses.
In the field of behavioural analysis, there are many algorithms relating to facial detection or monitoring the direction of view, making it possible to estimate the interest shown by the individuals passing close to the objects of interest.
Document U.S. Pat. No. 6,967,674 is known, describing a method for detecting the behaviour of persons passing through a room.
Document US 2003/0039379 is also known, describing a method for counting, in an image, the number of persons showing interest in an object. This interest can be characterized by the time spent by each person on viewing the object or also the expression on the face of these persons.
A purpose of the present invention is a novel method making it possible to express in a novel manner the attraction of an object of interest in an observation zone.
The above-mentioned purpose is achieved with a method for generating a density image of an observation zone through which individuals pass over a given time interval, in which method a plurality of images of the observation zone is acquired by means of a camera, the following steps being carried out for each image acquired:
extracting a fixed background from the image,
a step a) of detecting “blob” zones of pixels standing out from the fixed background of the image, this zone of pixels corresponding in fact to the moving elements, in particular the individuals,
a step b) of detecting the individuals, in which:
b1) for each blob, several hypotheses for possible positions are generated within a three-dimensional model of the observation zone, using the characteristics of the camera and a standard size of an individual;
and for each possible position, a head of an individual is detected by applying a three-dimensional model of an individual onto the corresponding blob,
b2) counting the individuals for each blob; in fact from among the possible position hypotheses, the most probable position is determined vis-à-vis the dimensions of the blob in question,
c) for each detected individual, determining the elementary areas occupied by this individual, and
d) a density image is generated in which consideration is given to a predetermined density image on which the thus-determined elementary areas are incremented by a level of intensity. This increment can be an increase in the level of grey at the places where the individuals were detected. This can also be the colouring of said places. The density image is preferably an image in which the ground plane of the observation zone is displayed.
With the system of the invention, a heat chart is prepared which can be a colour image: red zones corresponding to high occupation rates, and blue zones to low occupation rates, the variation between the colours being made continuously.
In fact, each elementary area of the observation zone has a level of intensity proportional to the occupation rate of this elementary area by individuals. The occupation rate being the total duration during which this elementary area has been occupied by any individual.
The invention can also be defined as follows: a method for generating a density image of an observation zone over a given time interval, in which a plurality of images of the observation zone is acquired, comprising:
According to an advantageous feature of the invention, the step b1) of detecting a head of an individual is carried out by contour analysis.
In particular, the step of detection of heads comprises the following steps:
applying a Canny filter to the image with respect to the zones of pixels so as to generate an image of contours of the zones of pixels,
applying a distance transform so as to prepare a chart of distances,
performing a convolution between the distance chart and a model of a head (template) so as to obtain a convolution chart, and
applying a watershed algorithm so as to detect heads of individuals.
The use of a watershed algorithm for detecting heads of individuals makes it possible to obtain the most probable positions of heads in the image without relying on arbitrary values (in particular threshold values) which limits the risks of non-detection and guarantees the uniqueness of a local solution.
According to a variant of the invention, the step b1) of detecting the head of an individual is carried out by determining coherences between the head and foot positions. To this end:
a correspondence is established between pixels of the image and points of the corresponding space,
it is assumed that the pixels of a blob correspond to a visible point of an individual and potentially to his feet or head, and
these pixels are projected onto the ground planes and planes placed at standard head height, the common projection zone of the feet and of the head being identified as a possible position.
Advantageously, the counting consists of simultaneously identifying the number of individuals present and their respective position. In other words, a segmentation of groups is carried out. Indeed, in most circumstances, the images originate from a camera sometimes having an oblique viewing angle with respect to the ground, thus causing many situations of occlusion between individuals, it is then difficult to use an individual-by-individual search. For this reason, the present invention thus proposes a solution seeking to detect and count a group (simultaneous identification of the number of persons present and their respective position), and not to detect each individual separately.
According to a feature of the invention, the identification of the individuals is carried out from a model of appearance having three ellipses imitating an individual.
This appearance model is optimized according to the following criteria:
Then, the optimization is carried out by means of an iterative gradient descent algorithm.
According to an advantageous feature of the invention, the convergence of the iterative algorithm is distributed over successive images. The convergence of the iterative algorithm towards an optimal count of the individuals can take a considerable time. With the present invention, an approximate partial solution for an image at
the time of passing to the following image is considered adequate; this approximate solution serving as a starting point for the following image.
According to an advantageous variant of the invention, step b) of detecting individuals can comprise the following steps:
based on a 3D model of the observation zone, a 3D model of an individual, and characteristics linked to the acquisition device of said plurality of observation images, a perspective projection is carried out in the current image of each elementary area of the observation zone so as to obtain a 2D projection of the models of an individual;
the correspondence of the 2D models on each zone of pixels is calculated so as to obtain a probability of presence density chart; and
maxima are sought by using the watershed algorithm so as to determine the detected individuals.
Preferably, the characteristics of the acquisition device comprise the field of view, the positioning and the distortion.
Generally, according to the invention, the extraction of the fixed background of the image is obtained by means of the values of the pixels of the image over time.
According to the invention, in order to take account of the low variations in the values of the pixels, the value of each background pixel can be modelled by a distribution of probabilities. This distribution can be represented by a Gaussian distribution, or more moderately by an average value and two values min and max.
Moreover, in order to minimize as far as possible the influence of objects that do not belong to the background, before averaging the values of the pixels of the image, it is possible to carry out an instantaneous detection of movement by subtracting successive images and applying a threshold in order to obtain a mask corresponding to the pixels which will not be averaged.
Other advantages and characteristics of the invention will become apparent on examining the detailed description of one embodiment, which is in no way limitative, and the attached drawings, in which:
The density chart can also be displayed in the form of a thermal image, capable of being shown in perspective in a computer-generated image or as a plan view superimposed on the observed surface plane.
In step a1, the background of the acquired image is extracted so as to obtain a fixed-background model. The background extraction algorithm can be of the iterative type, in particular over several images or a video image stream. The simplest approach for extracting the background consists of averaging the values of the pixels over time. The less the contribution to the average of a moving object is, the more rapidly it moves compared with the cadence of the images.
More precisely, the extraction of the background according to the invention leads to a modelling of grey levels, gradients and average gradient orientations and their standard deviations. By way of example,
In step a2, a zone detection is carried out by comparing the image and the background, and by extracting zones of pixels, called “blobs”, that do not belong to the background.
In step b1, a detection of the set of the possible positions of persons is carried out. This step can be performed in two ways:
The result of step b1 is a set of hypotheses for head positions or for head/foot pairing.
Then, in the step b2, counting of the individuals is carried out. More precisely, an identification of the number of individuals in each blob and an estimation of their position is carried out. In fact the false hypotheses are eliminated in order to identify the actual positions.
Turning now to the head detection method.
Then, a distance transform is carried out. This is a calculation of a chart of distance between contours. This is a quick and stable calculation of the correlation between the head template and the contours. The output of this step is a chart of distances.
Next, a convolution is carried out between the chart of distances and the template in order to obtain a convolution chart. The convolution comprises a calculation of the correlation between the head template and the contours.
Finally, a watershed algorithm is applied in order to localize and quantify correlation maxima and determine probability maxima for the presence of heads in the acquired image. At the output, a hypothesis on the position of the heads is obtained.
Turning now to the head-foot coherence method.
It is assumed that the pixels of a blob correspond to a visible point of a person to be detected, and potentially to his feet or head. The projection of these pixels onto the ground planes and planes placed at a standard head height thus delimit the zones where persons would be likely to be found. The conjunction of these projections, shown in
The convergence towards the optimal solution is carried out by iterative methods of the gradient descent type. The evaluation of this solution on the basis of the superimposition of the appearance models allows natural management of the cases of occlusion.
A feature of the detection according to the invention is performance within the three-dimensional reference point of the observed area, the cameras being calibrated with precision in this environment; the calibration consists of estimating with precision the position and orientation of the camera, as well as its intrinsic geometrical properties such a focal length, field, distortions, etc.
Thus the heat chart produced can be directly superimposed onto the plane of the analyzed surface, and only the places corresponding to the actual position of the individuals detected are counted (zone situated at the level of their feet), and not the set of positions on the ground corresponding to the pixels of the blobs.
In
This variant therefore makes it possible to manage the perspective of the scene, as well as the camera defects such as distortion, this parameter being taken into account in the calculation of the projection of the parallelepipeds.
This variant also makes it possible to take account of the masking by objects: if it is known that a zone corresponds to an object behind which persons would be likely to be partially masked, the estimation of the probability of presence is corrected as a function of the number of pixels of the masking object situated in the search window.
On the image 11 of
In order to avoid multiple detections of a single individual (one of the detections corresponding to his trunk, and interpreted as a person further away), the detections obtained in the previous step are classified and considered in increasing order of the distance separating them from the monitoring camera. The detections are validated from the closest to the furthest away.
For a detection to be validated, it is provided that the parallelepiped is filled by pixels of a blob above a predetermined rate. Each validated detection deletes from the blob the pixels contained in its parallelepiped, in order to avoid multiple detections.
The validated detections inscribe their position on the density image by adding, at their relative position, a Gaussian distribution to a cumulative table.
The present invention also relates to a software application or computer program comprising instructions for executing the defined steps in a method according to the invention.
The invention also relates to a means of data storage such as a CD-ROM, a USB stick, a flash memory, etc. storing an application program code which when executed by a digital processor provides functionalities such as those defined in any method according to the present invention.
Of course, the invention is not limited to the examples which have just been described and numerous adjustments can be made to these examples without exceeding the scope of the invention.
Zeller, Alexandre, Revue, Alexandre
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
6771818, | Apr 04 2000 | Microsoft Technology Licensing, LLC | System and process for identifying and locating people or objects in a scene by selectively clustering three-dimensional regions |
6967674, | Sep 06 1999 | Displaycom GmbH | Method and device for detecting and analyzing the reception behavior of people |
7930204, | Jul 25 2006 | NYTELL SOFTWARE LLC | Method and system for narrowcasting based on automatic analysis of customer behavior in a retail store |
20030039379, | |||
20080166045, | |||
20090237499, | |||
20090296989, | |||
20100013935, | |||
WO2006097680, | |||
WO2008008505, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Feb 12 2009 | CLIRIS | (assignment on the face of the patent) | / | |||
Feb 12 2009 | Alexandre, Zeller | (assignment on the face of the patent) | / | |||
Mar 20 2011 | ZELLER, ALEXANDRE | CLIRIS | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 026182 | /0582 | |
Mar 20 2011 | REVUE, ALEXANDRE | CLIRIS | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 026182 | /0582 | |
Mar 20 2011 | ZELLER, ALEXANDRE | ALEXANDRE ZELLER | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 026182 | /0582 | |
Mar 20 2011 | REVUE, ALEXANDRE | ALEXANDRE ZELLER | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 026182 | /0582 |
Date | Maintenance Fee Events |
May 09 2017 | M2551: Payment of Maintenance Fee, 4th Yr, Small Entity. |
May 18 2021 | M2552: Payment of Maintenance Fee, 8th Yr, Small Entity. |
Date | Maintenance Schedule |
Nov 19 2016 | 4 years fee payment window open |
May 19 2017 | 6 months grace period start (w surcharge) |
Nov 19 2017 | patent expiry (for year 4) |
Nov 19 2019 | 2 years to revive unintentionally abandoned end. (for year 4) |
Nov 19 2020 | 8 years fee payment window open |
May 19 2021 | 6 months grace period start (w surcharge) |
Nov 19 2021 | patent expiry (for year 8) |
Nov 19 2023 | 2 years to revive unintentionally abandoned end. (for year 8) |
Nov 19 2024 | 12 years fee payment window open |
May 19 2025 | 6 months grace period start (w surcharge) |
Nov 19 2025 | patent expiry (for year 12) |
Nov 19 2027 | 2 years to revive unintentionally abandoned end. (for year 12) |