systems and methods are disclosed for locating a retroreflective object in a digital image and/or identifying a feature of the retroreflective object in the digital image. In certain environmental conditions, e.g. on a sunny day, or when the retroreflective material is damaged or soiled, it may be more challenging to locate the retroreflective object in the digital image and/or to identify a feature of the object in the digital image. The systems and methods disclosed herein may be particularly suited for object location and/or feature identification in situations in which there is a strong source of ambient light (e.g. on a sunny day) and/or when the retroreflective material on the object is damaged or soiled.
|
15. A method comprising:
activating an illuminator to emit light, and acquiring a first image from a first image acquisition device with light emitted from the illuminator;
acquiring a second image from a second image acquisition device without or with less of the light emitted from the illuminator;
generating a compound image from the first image from the first image acquisition device and the second image from the second image acquisition device;
aligning content of the first image and the second image to at least account for different physical locations of the first image acquisition device and the second image acquisition device prior to generating the compound image; and
locating an object in the compound image.
1. A system comprising:
a first image acquisition device to acquire at least a first image with light emitted from an illuminator;
a second image acquisition device to acquire at least a second image without or with less of the light emitted from the illuminator;
a memory to store at least the first image and the second image;
a processor to:
generate a compound image from the first image from the first image acquisition device and the second image from the second image acquisition device;
align content of the first image and the second image to at least account for different physical locations of the first image acquisition device and the second image acquisition device prior to generating the compound image; and
locate an object in the compound image.
25. A non-transitory processor-readable medium having stored thereon instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising:
receiving, from a first image acquisition device, a first image that was acquired by the first image acquisition device with light emitted from an illuminator;
receiving, from a second image acquisition device, a second image that was acquired by the second image acquisition device without or with less of the light emitted from the illuminator;
generating a compound image from the first image from the first image acquisition device and the second image from the second image acquisition device;
aligning content of the first image and the second image to at least account for different physical locations of the first image acquisition device and the second image acquisition device prior to generating the compound image; and
locating an object in the compound image.
3. The system of
4. The system of
5. The system of
6. The system of
7. The system of
8. The system of
9. The system of
10. The system of
11. The system of
12. The system of
14. The system of
17. The method of
18. The method of
19. The method of
20. The method of
21. The method of
22. The method of
23. The method of
24. The method of
26. The non-transitory processor-readable medium of
27. The non-transitory processor-readable medium of
28. The non-transitory processor-readable medium of
29. The non-transitory processor-readable medium of
|
This application is a continuation of U.S. patent application Ser. No. 15/891,747, titled “Systems and Methods for Locating a Retroreflective Object in a Digital Image” and filed on Feb. 8, 2018. The contents of the aforementioned application are incorporated by reference herein.
The following relates to locating a retroreflective object in a digital image and/or identifying a feature of the retroreflective object in the digital image.
A retroreflective material is a material that reflects light back to its source with a minimum of scattering. Retroreflective material may be used to increase the conspicuity of items such as traffic signs, license plates, parts of vehicles, obstructions, clothing etc., particularly at nighttime.
A camera may take a digital photograph of a scene to acquire a digital image. The scene may include a retroreflective object, i.e. an object having a retroreflective material. A processor may be configured to locate the object in the digital image. The processor may further be configured to identify features of the object in the digital image. The retroreflective properties of the object may cause the object to be more conspicuous in the digital image compared to the surrounding scene in the digital image. This may assist the processor in locating the object and/or identifying features of the object in the digital image.
For example, vehicle license plates are typically retroreflective. An automatic license plate recognition (ALPR) system may take a digital photograph of a vehicle to acquire a digital image, and then search for the license plate in the digital image. The ALPR system may further process the located license plate in the digital image in order to determine the symbols of the vehicle's license registration identifier. The retroreflective properties of the license plate may allow for the ALPR system to more easily locate the license plate in the digital image and/or more easily determine the vehicle's license registration identifier on the license plate.
As another example, street signs are typically retroreflective. An automatic recognition system on an autonomous vehicle may take a digital photograph of the scene in front of the vehicle to acquire a digital image, and then search for a traffic sign in the digital image. The system may further process a located traffic sign in the digital image in order to determine the information being conveyed by that traffic sign. The retroreflective properties of the traffic sign may allow for the system to more easily locate the traffic sign in the digital image and/or more easily determine the information being conveyed by the traffic sign.
In certain environmental conditions, e.g. on a sunny day, or when the retroreflective material is damaged or soiled, it may be more challenging to locate a retroreflective object in a digital image and/or to identify a feature of the retroreflective object in the digital image.
Systems and methods are disclosed herein for locating a retroreflective object in a digital image and/or identifying a feature of the retroreflective object in the digital image. The systems and methods may be particularly suited for situations in which there is strong source of ambient light (e.g. on a sunny day) and/or when the retroreflective material on the object is damaged or soiled.
According to one embodiment, there is provided a system that includes at least one image acquisition device. The at least one image acquisition device acquires a plurality of digital images. The plurality of digital images includes a first image acquired with light emitted from an illuminator, and a second image acquired without or with less of the light emitted from the illuminator. The system further includes a memory to store the plurality of digital images. The system further includes a processor to generate a compound image from the first image and the second image. The processor may also be configured to align content of the first image and the second image prior to generating the compound image. The processor may also be configured to locate an object in the compound image.
In another embodiment, there is provided a system including a memory to store a plurality of digital images. The plurality of digital images includes a first image that was acquired with light emitted from an illuminator, and a second image that was acquired without or with less of the light emitted from the illuminator. The system also includes a processor to generate a compound image from the first image and the second image. The content of the first image and the second image may first be aligned before generating the compound image. The processor may also be configured to locate an object in the compound image.
In another embodiment, there is provided a method including activating an illuminator to emit light, and acquiring a first image with light emitted from the illuminator. The method may further include acquiring a second image without or with less of the light emitted from the illuminator. The method may further include generating a compound image from the first image and the second image. The method may further include aligning content of the first image and the second image prior to generating the compound image. The method may further include locating an object in the compound image.
In another embodiment, there is provided a method including retrieving from memory a plurality of digital images. The plurality of digital images includes a first image that was acquired with light emitted from an illuminator, and a second image that was acquired without or with less of the light emitted from the illuminator. The method may further include generating a compound image from the first image and the second image. The method may further include aligning content of the first image and the second image prior to generating the compound image. The method may further include locating an object in the compound image.
In another embodiment, there is provided a system for recognizing a vehicle license plate. The system includes a memory to store a plurality of digital images. The plurality of digital images includes a first image that was acquired with light emitted from an illuminator, and a second image that was acquired without or with less of the light emitted from the illuminator. The system further includes a processor to process the first image and the second image to locate the vehicle license plate.
In another embodiment, there is provided a method for recognizing a vehicle license plate. The method may include retrieving from memory a plurality of digital images. The plurality of digital images may include a first image that was acquired with light emitted from an illuminator, and a second image that was acquired without or with less of the light emitted from the illuminator. The method may further include processing the first image and the second image to locate the vehicle license plate.
In another embodiment, there is provided a method that includes activating an illuminator to emit light, and acquiring a first image with light emitted from the illuminator. The method may further include acquiring a second image without or with less of the light emitted from the illuminator. The method may further include generating a compound image from the first image and the second image. The method may further include searching for a vehicle license plate in the compound image.
In another embodiment, there is provided a system including at least one image acquisition device to acquire a plurality of digital images. The plurality of digital images includes a first image acquired with light emitted from an illuminator, and a second image acquired without or with less of the light emitted from the illuminator. The system further includes a memory to store the plurality of digital images. The system further includes a processor. The processor is configured to generate a compound image from the first image and the second image. The processor may be further configured to search for a vehicle license plate in the compound image. The system may further include the illuminator.
Embodiments will be described, by way of example only, with reference to the accompanying figures wherein:
For illustrative purposes, specific embodiments and examples will be explained in greater detail below in conjunction with the figures.
The automatic recognition system 162 further includes an image acquisition device 204 for acquiring digital images. The image acquisition device 204 may be any device for acquiring digital images, e.g. using a sensor such as a charge coupled device (CCD) sensor or a complementary metal-oxide-semiconductor (CMOS) sensor. In the embodiments below, the image acquisition device 204 is a digital camera. The digital camera 204 is located relatively close to the illuminator 202 so that the digital camera 204 may capture light reflected off of the retroreflective material 166 back at the illuminator 202. The illuminator 202 may be the flash for the digital camera 204. When the digital camera 204 is actuated to take a digital photograph, an aperture (not shown) on the digital camera 204 opens and the incoming light rays are detected using a sensor (not shown). The digital camera 204 converts the incoming light rays into a digital image.
An acquired digital image comprises a grid of pixels. Each pixel has a location coordinate (x,y) indicating the location of the pixel in the digital image, as well as a corresponding pixel value Y(x,y). The pixel value represents the intensity of the pixel at the pixel coordinate. A pixel value may have an intensity consisting of a single component, e.g. when the camera 204 provides a monochromatic output, as is typical with infrared cameras. In other embodiments, a pixel value may have an intensity consisting of different components, e.g. a red, green, and blue component, or a luminance (Y), blue-difference chroma (Cb), and red-difference chroma (Cr) component if the image taken by the camera 204 is a colour image. In the case of multi-component pixels, the pixel value may be the value of a single component (typically luminance) or a value derived as a function of multiple components. In one embodiment, the camera 204 captures monochromatic images, and the pixel value of each pixel in a captured monochromatic image is a single luminance component that represents the intensity of that pixel.
Each captured digital image is stored in a memory 206. A processor 208 processes one or more digital images stored in the memory 206 in order to detect the object 164 in a digital image and/or in order to identify one or more features of the object in the digital image. A controller 210 controls actuation of the camera 204 to acquire digital images, and also controls activation/deactivation of the illuminator 202 to synchronize turning on/off the light emitted by the illuminator 202. For example, as explained later in relation to
The controller 210 may be fully or partially included as part of the processor 208. Therefore, when “controller” is used herein, it may be replaced with “processor” instead. The processor 208 and controller 210 may each be implemented by one or more processors that execute instructions stored in memory 206. The instructions, when executed, cause the processor 208 and controller 210 to perform the operations described herein, e.g. subtracting images, aligning content between images, locating the object 164 in an image, identifying features of the object 164, controlling/synchronizing activation/deactivation of the illuminator 202, controlling actuation of the camera 204, etc. Alternatively, some or all of the processor 208 and/or controller 210 may be implemented using dedicated circuitry, such as an application specific integrated circuit (ASIC), a graphics processing unit (GPU), or a programmed field programmable gate array (FPGA) for performing the operations of the processor 208 and/or controller 210.
The retroreflective material 166 on the object 164 may facilitate locating the object 164 in the digital image and/or identifying one or more features of the digital object 164, particularly when there is not much ambient light. As an example, assume the automatic recognition system 162 is used to locate and read license plate 164 in digital image 252 of
Another problem may arise when the retroreflective material 166 on the object 164 is damaged or soiled, which may result in a poor quality image when the illuminator 202 is used. In such a situation, feature identification by the processor 208 may actually be more difficult compared to only using ambient light to capture the image. For example,
The technical problems discussed above are presented in the context of automatic license plate recognition. However, it will be appreciated that similar technical problems may exist in other applications that use automatic object recognition, e.g. when automatically locating and/or identifying features in a traffic sign, or when automatically recognizing the presence of another vehicle or obstruction on the road having a retroreflective material, which may be applicable for autonomous vehicles, or for driver assistance systems (e.g. that perform object detection around a vehicle to help a human driver).
Embodiments described below aim to address at least some of the technical problems described above by providing an improvement in a technological system for automatic object recognition. Embodiments described below aim to provide specific enhanced non-generic computer functionality.
In some embodiments below, at least one image acquisition device (e.g. a digital camera) is used to acquire a plurality of digital images of an object comprising a retroreflective material. The plurality of digital images includes at least a first image and a second image. The first image is acquired with light emitted from the illuminator. Therefore, the first image includes both retroreflective light and ambient light, and the pixel intensities of the first image will be proportional to the light intensity of the retroreflective light and ambient light. A second image is acquired without the light emitted from the illuminator. Therefore, the second image only includes the ambient light, and the pixel intensities of the second image will be proportional to the light intensity of the ambient light. The processor may then subtract the second image from the first image to result in a compound image having at least some of the ambient light removed. The subtraction may comprise, for each pixel coordinate, subtracting the pixel value at that coordinate in the second image from the pixel value at that coordinate in the first image to obtain a difference value, and then using the difference value as the pixel value at that coordinate in the compound image. In some embodiments, the difference value may be an absolute value of the difference, however in one example, the difference value is a subtraction whereby the pixel value is set to zero if the subtraction results in a negative number.
In some embodiments, a pixel value used in the subtraction operation may have only one component representing intensity (e.g. in the case of an infrared image). In other embodiments, a pixel value used in the subtraction operation may include multiple components (e.g. Y, Cb, Cr). In other embodiments, a pixel value used in the subtraction operation may be a function of multiple components. For example, a pixel value may be a single value that represents only the Y component of that pixel, or may be a single value that represents some combination of Y, Cb, and Cr for that pixel, or may be a single value that represents the original pixel value, or one component of the original pixel value, further modified by a function. In any case, if the pixel values used in the subtraction have more than one component, then in one embodiment the subtraction occurs at each respective component.
After the subtraction occurs to generate the compound image, the processor may then locate the object in the compound image. It may be easier to locate the object in the compound image compared to in the first image or the second image, because at least some of the ambient light has been removed from the compound image. This may particularly be the case when there is a strong source of ambient light present in the environment, such as on a sunny day. In the compound image, the retroreflective object will generally be more conspicuous than the surrounding scene in the compound image. This is because at least some of the ambient light has been removed from the compound image, and the retroreflective light still present in the image will largely comprise light reflected directly back from the retroreflective object, rather than from other non-retroreflective items in the scene that scatter the light from illuminator. The pixel intensities of the compound image will be proportional to the light intensity of the light emitted from the illuminator and reflected back at the camera, which will mostly be the retroreflected light from the object. The retroreflective object will therefore have pixel values that, in general, have a higher intensity than the pixel values of the scene surrounding the retroreflective object.
In order to generate first and second images over time, the controller 210 may control the camera 204 to actuate and take a digital photograph on a regular basis, e.g. every 1/45th of a second, and the controller 210 may control the illuminator 202 to emit light during every other digital photograph. For example,
Content in the first and second images may be aligned before obtaining the compound image, e.g. to account for motion, such as the retroreflective object and/or the automatic recognition system moving between acquiring adjacent digital images. Motion between adjacent images may also happen as a result of relatively long camera exposition and/or camera vibration. If there is movement between acquiring adjacent digital images, then object position expressed in image pixel coordinates will be different in adjacent images. For example,
After the object 164 is located in the compound image, the processor 208 may process the digital image of the object 164 in order to identify a feature of the object 164. For example, if the object 164 is a license plate, the processor 208 may attempt to identify the vehicle registration identifier (e.g. “ABC 123”) written on the license plate. The processor 208 may use any one of the acquired images (first image, second image, or compound image) to try to identify the feature of the object 164. Depending on the environmental conditions, e.g. whether it is a sunny day or at night, and whether the retroreflective material 166 on the object 164 is damaged or soiled, then one of the acquired images may more clearly display the feature than the other. Therefore, in one embodiment, the processor 208 may perform feature identification on each one of the acquired images and output the result, along with a measure of confidence of the outputted result. The processor 208 may then select the identified feature associated with the highest confidence measure. For example,
As one example, consider operation of the automatic recognition system 162 on a sunny day and in a situation in which some of the retroreflective material 166 of the object 164 is damaged. Locating the object 164 in the compound image may be easier compared to locating the object in the first or second image, because the strong ambient light from the sun is reduced or eliminated in the compound image, which causes the object 164 to appear in the compound image as relatively more conspicuous (relatively more intense pixel values) than the surrounding scene. However, due to the damaged retroreflective material 166, the feature to be identified in the object may be more accurately identified in the second image because the second image includes only the ambient light. The second image does not include the retroreflected light, which may distort the portion of the object having the damaged retroreflective material.
In embodiments described above, the same camera 204 takes successive digital images. Alternatively, there may multiple cameras, e.g. two cameras may be used: one to take images with light emitted from the illuminator 202, and the other one to take images without the light emitted from the illuminator 202. For example,
When only one camera is used to take both the first and second images, e.g. as in
In step 452, the controller 210 activates the illuminator 202 to emit light. In step 454, the controller 210 activates an image acquisition device to acquire a first image of an object 164 having a retroreflective material 166. The first image is acquired with light emitted from the illuminator 202. Therefore, the first image includes both retroreflected light comprising light from illuminator 202 that has reflected off of the retroreflective material 166, as well as any ambient light that might be present.
In step 456, the controller 210 activates an image acquisition device to acquire a second image of the object 164. The second image is acquired without or with less of the light emitted from the illuminator 202. For example, the controller 210 may deactivate the illuminator 202 prior to acquiring the second image so that the illuminator 202 does not emit any light. As another example, a second image acquisition device may be used to capture the second image, and the second image acquisition device may be physically separated from the illuminator 202 in such a way that retroreflected light from the illuminator 202 is not captured in the second image. In yet another alternative embodiment, a shutter may be used so as to controllably block and admit light from the illuminator.
The terms “first image” and “second image” as used herein are labels to distinguish between two different images. The order in which the images are acquired is not implied. For example, in some embodiments, step 456 may be performed before steps 452 and 454.
In step 458, the processor 208 aligns content in the first and second images. It should be appreciated that step 458 may result in no change to either image in cases where, e.g. there is no movement of the object or of the automatic recognition system 162 between acquiring the first image and the second image, such that the first image and the second image have substantially the same content in substantially the same location in both images. In some embodiments, step 458 may be optional.
In step 460, the processor 208 generates a compound image by subtracting the second image from the first image. As described above, the first and second images may first need to be aligned.
In step 462, the processor 208 then searches for the object in the compound image in order to locate the object in the compound image. The exact procedure implemented by the processor 208 to locate the object in the compound image is implementation specific and may depend upon the object actually being located (e.g. a license plate versus a traffic sign versus an obstruction having a retroreflective surface, etc.). However, in general the object in the compound image will have pixels that have a higher intensity compared to the pixels in the surrounding scene of the compound image. This is because the ambient light has been reduced or eliminated in the compound image, and the retroreflective light still present in the compound image will largely comprise light reflected directly back from the object 164, rather than from other non-retroreflective items in the scene that scatter the light from illuminator 202. Therefore, the procedure for locating the object may rely, at least in part, upon identifying pixels having a higher intensity compared to other pixel. For the sake of completeness, an example method for searching for an object in a digital image to locate the object (where the object is a license plate) is disclosed in Zheng, D., Zhao, Y. and Wang, J., 2005. “An efficient method of license plate location”, Pattern recognition letters, 26(15), pp. 2431-2438. Another example method is disclosed in Anagnostopoulos, C. N. E., Anagnostopoulos, I. E., Loumos, V. and Kayafas, E., 2006, “A license plate-recognition algorithm for intelligent transportation system applications”, IEEE Transactions on Intelligent transportation systems, 7(3), pp. 377-392.
Optionally, in step 464, the processor 208 uses at least one of the first image, the second image, and the compound image to identify one or more features of the object. For example, as explained above in relation to
Step 464 is optional because in some applications it may not be necessary to identify a feature in an object having a retroreflective material. For example, if the object is an obstruction having a retroreflective surface, just identifying the location of the obstruction may be all that is needed.
In some embodiments, the components of the automatic recognition system 162 may be distributed. For example, the processor 208 may be remote from the one or more digital cameras, e.g. the processor 208 may be in the cloud and receive the digital images from the one or more digital cameras over a network.
As discussed above, it may be necessary to align content in the first image (illuminator on) with content in the second image (illuminator off) before generating the compound image. The following describes one specific example method for aligning content of the first and second images. It will be appreciated that other methods of aligning content in the first and second images may be used instead.
The example method described below operates as follows. Three adjacent images are acquired: image A is first acquired with light emitted from the illuminator 202, such that image A includes both ambient light and retroreflective light (A+R); image B is then acquired without the light emitted from illuminator 202, such that image B includes only ambient light (A). Image C is then acquired with light emitted from the illuminator 202, such that image C includes both ambient light and retroreflective light (A+R). The retroreflective object 164 moves between acquiring images A, B, and C, and therefore the object 164 is displaced by a displacement vector between each of the images.
Image A or image C is used as the ‘first image’ (illuminator on) in the method of
To obtain displacement vector d2 from displacement vector d1, it is first necessary to obtain displacement vector d1. One example way to determine displacement vector d1 is explained with reference to
Two images J and K, and a set of predicted motion vectors are input into the motion estimator ME. Each predicted motion vector of the set of predicted motion vectors corresponds to a respective rectangular region in image J and K, and predicts the displacement of the content from image J to image K in that respective rectangular region. For example, the image J may be partitioned into 16×16=256 rectangular regions. For each rectangular region of J, steps 1 to 3 are performed below, which are illustrated in
Step 1: Form a set of rectangular areas in image K by translating the position of the rectangular region in image J by the predicted motion vector corresponding to that region, and by further translating the rectangular region with all the possible translations in a search range, i.e. all translations of an integer number of pixels in the horizontal and vertical directions, e.g. with range of ±16 pixels horizontally and ±12 pixels vertically. With reference to
Step 2: Compare the pixels of the rectangular region of image J to the pixels in each rectangular area in image K. Select the rectangular area in image K that has the pixels the most similar to pixels in the rectangular region in image J. In this example, the similarity measure used is the sum of absolute difference of the pixels. With reference to
Step 3: Output a motion vector for the rectangular region as the displacement between the rectangular region of image J and the selected rectangular area in image K. This is motion vector 620 in
When steps 1 to 3 are repeated for each rectangular region of J, the result is a set of output motion vectors, each one corresponding to a respective rectangular region in image J and K.
Further detail describing one possible implementation of the motion estimation algorithm may be found in the publication “Intro to Motion Estimation Extension for OpenCL*” by Maxim Shevtsov, published by Intel Corporation in 2013.
Returning back to
Images A8 and C8 are input into motion estimator 524, along with a set of predicted motion vectors each having a value (0,0), i.e. no predicted displacement. The set of output motion vectors from motion estimator 524 are upsampled and multiplied by two via upsampler 526, and then input as the predicted motion vectors into motion estimator 528, along with images A4 and C4. The set of output motion vectors from motion estimator 528 are upsampled and multiplied by two via upsampler 530, and then input as the predicted motion vectors into motion estimator 532, along with images A2 and C2. The set of output motion vectors from motion estimator 532 are upsampled and multiplied by two via upsampler 534. The output of upsampler 534 is displacement vector d1 in the form of a set of displacement vectors, each displacement vector in the set corresponding to a respective rectangular region in images A and C.
The displacement vector d1 may then have its magnitude divided in half to obtain displacement vector d2.
Image A or image C may be selected as the ‘first image’ in the method of
In some embodiments, optional step 464 of
In the example described above in relation to
Many variations of the embodiments described above are possible. Some example variations are described below.
As mentioned earlier, the image captured with the illuminator on and the image captured with the illuminator off may be acquired by two different cameras. However, in some embodiments, the exposures of these cameras may be translated slightly in time, just enough to have the illuminator turned on for one exposure and turned off for the other exposure. This may allow the alignment of the scene between the two images to be constant because the displacement of the scene or the displacement of the automatic recognition system would be negligible. This may remove the need to dynamically determine the displacement of content from one image to the other during operation. However, it may require a calibration step in which the constant displacement is determined between the scene in the images captured by the two cameras.
In embodiments above, the second image is acquired without or with less of the light from the illuminator. This may be achieved in different ways, e.g. by alternating the illuminator on and off (as in
Step 1: Set C=1.
Step 2: Align content of first and second images, as necessary.
Step 3: Calculate the compounded image as per Camera1(Ambient+Retroreflected)−C×Camera2(Ambient).
Step 4: Evaluate the average value of the 10% least illuminated pixels.
Step 5: If the average is negative, reduce C by a determined percentage that is proportional to the average, and recalculate from Step 3.
Step 6: If the average is positive and higher than a determined value, increase C by a determined percentage that is proportional to the average, and recalculate from Step 3.
Step 7: If the average is positive and lower than a determined value, exit the algorithm. Take the value of C as output to be used in the subtraction operation when subtracting the second image from the first image.
A periodic or always running algorithm, such as that above, can calculate and maintain the value C so that the result of the substraction where the image is exposed by the ambient light is as low as possible without being negative. Note that the algorithm may not need to be executed once per incoming pair of images, because the value C is not expected to change very fast. In general, C may only change if the lighting conditions change.
During operation, the processor calculates the compound image in the same way described earlier, but incorporating value C, e.g.: obtain non-illuminated image B, align it with image A using dynamic displacement and/or constant (static) displacement as needed, and then subtract both images using the modified subtraction Pixel(Image A)−{C×Displaced Pixel(Image B)}.
Other variations are described below.
In some embodiments, if there is no (or very small) motion of scene or of the automatic recognition system, then finding the alignment between the scenes in images may result in no change to any of the images. In some embodiments, the alignment step may be omitted, even if there is some motion between adjacent images, because the reduction in complexity from omitting the alignment step may outweigh the reduced accuracy.
In some embodiments, the automatic recognition system may further comprise an auto-exposure algorithm to determine an appropriate exposure level (gain, exposure time, iris) that is best suited to produce a compound image with good contrast in the retroreflective object (e.g. with good contrast in the license plate).
In some embodiments, the automatic recognition system may be configured to disable the method described herein if the ambient light is small enough that it does not impede object localization. For example, at night when there is little to no ambient light, the system may be configured to just take a digital photograph with the illuminator on, and locate the retroreflective object using just that image. Subtracting an image with the illuminator off from an image with the illuminator on (i.e.
In some embodiments, the automatic recognition system may further comprise an extra illuminator located far from the camera, and that would provide ambient light to result in an image that is not completely dark when the main illuminator is off, even if there is no sunlight.
In some embodiments, the automatic recognition system may further comprise an apparatus/method to compensate for different exposure levels (exposure time, gain, iris) between images. This compensation may occur before or during scene alignment or image subtraction.
Many variants can be implemented to align the content in various images, e.g.: one or more of the following variants: different algorithms could be used, in particular the automatic recognition system could detect keypoints, extract descriptors from them, and match these descriptors; or the automatic recognition system could use optical flow for content alignment; the image captured with illuminator on and the image captured with illuminator off may be directly aligned together without relying on the alignment between only images captured with the illuminator on; the displacement vectors may be processed by local filtering to remove outliers, to favor displacements that are smooth; the motion vectors computed in previous images may be used to help the computation of motion vectors; the motion vectors directions may be found from a mean of motion vector directions in previous images; the motion vectors directions may be forced to be in accordance with a perspective motion field; the motion vectors directions may be determined according to the displacement of the automatic recognition system, e.g. if it is known by some means (e.g., by accelerometers, GPS, etc.); the images may be subsampled by other factors than described; the translation range may be different than described; the similarity measure between pixels in two rectangular regions may be different than described.
In embodiments described above, the compound image is generated by subtracting the second image from the first image, i.e. a subtraction operation. However, more generally the compound image may be generated from or using the first image and the second image, e.g. by combining the first image and the second image, and not necessarily subtraction. For example, an artificial intelligence method, such as a machine learning algorithm, may be used. Also, more generally, in some embodiments a compound image may not even be generated. Rather, the first and second images may be processed to locate the object, e.g. via an artificial intelligence method, such as by a machine learning algorithm. For example, in some embodiments, instead of using the compound image to locate the object, the algorithm implemented by the processor for object location may receive in parallel the image captured with the illuminator on and an aligned image obtained from the image captured with the illuminator off. The pixel intensities of these two images could then be combined to locate the object. In some embodiments, a compound image may be generated that, instead of being the subtraction of the two images, is the “channel concatenation” of the two images. A channel of an image refers to a component of an image, e.g. one colour or luminance component of the image. A colour image may have three channels, e.g. a red channel, blue channel, and green channel, where the red channel is a grayscale image of the same size as the original colour image but only representing the red component of each pixel, where the blue channel is a grayscale image of the same size as the original colour image but only representing the blue component, and where the green channel is a grayscale image of the same size as the original colour image but only representing the green component. A monochromatic image only has one channel. The channel concatenation of two images refers to concatenating the channels of each image. Channel concatenation involves creating a data structure that is a 2D set of pixels (an image), but where each pixel has information about all of the channels of the original images. For example, if the first image had pixel value R1(x,y) at pixel coordinate (x,y) of its red channel, if the second image had pixel value R2(x,y) at corresponding pixel coordinate (x,y) of its red channel, if the first image had pixel value B1(x,y) at pixel coordinate (x,y) of its blue channel, if the second image had pixel value B2(x,y) at corresponding pixel coordinate (x,y) of its blue channel, if the first image had pixel value G1(x,y) at pixel coordinate (x,y) of its green channel, and if the second image had pixel value G2(x,y) at corresponding pixel coordinate (x,y) of its green channel, then the channel concatenation of the two images is [R1(x,y), R2(x,y), B1(x,y), B2(x,y), G1(x,y), G2(x,y)] at pixel coordinate (x,y). In one embodiment, two monochrome images (the first image and the second image) may be taken, and combined to form a compound image that has two channels (one corresponding to the first image and one corresponding to the second image). In another embodiment, two red-green-blue images (the first image and the second image) may be taken and combined to form a compound image that has six channels (one channel for each red, green, blue component for each image). Some embodiments may use deep learning for object detection. A deep learning algorithm may accept images with any number of channels as input images. The deep learning may use training to automatically find intermediate “features” that can be based on the difference between pixels of different channels. For example, in some embodiments, a compound image may be generated that is the channel concatenation of the two images (illuminator on and illuminator off), followed by a deep learning feature identification algorithm that can automatically take advantage of all of the information from all of the channels, possibly including subtraction of the pixel values of the different channels.
In some embodiments, the automatic recognition system may better align the image regions of the retroreflective object after the retroreflective object has been localized, but before identifying a feature in the retroreflective object. For example, the automatic recognition system may better align the image regions of a license plate after the license plate has been localized, but before the license plate is read. This alignment may be a global alignment of the license plate regions.
In some embodiments, instead of identifying a feature of the retroreflective object separately in the image captured with illuminator on, in the image captured with illuminator off, and in the compound image (e.g. instead of steps 722 to 732 of
In some embodiments, a plurality of retroreflective objects may be located in an image (e.g. in the compound image described above), and instead of identifying a feature in one of the objects, a feature may be identified based on the pattern, location, and/or arrangement of the objects. For example, each retroreflective object may be a character. The characters may be individually located in the image, and the combination of characters is read.
In step 954, the first image and the second image are processed to search for an object in order to locate the object. The processing may be of any type, e.g. an artificial intelligence algorithm, subtraction of images, etc. In some embodiments, the processing includes generating at least one compound image from the first image and the second image. The compound image may then be used to search for the object to locate the object. In some embodiments, generating the compound image includes combining the first and second images, e.g. by subtracting the second image from the first image. In some embodiments, the method further includes aligning content of the first image and the second image prior to generating the compound image.
In any of the embodiments described above in relation to the method of
In any of the embodiments described above in relation to the method of
In any of the embodiments described above in relation to the method of
Further to the above, some specific examples are provided below.
A system comprising: at least one image acquisition device to acquire a plurality of digital images, wherein the plurality of digital images includes a first image acquired with light emitted from an illuminator, and a second image acquired without or with less of the light emitted from the illuminator; a memory to store the plurality of digital images; a processor to: generate a compound image from the first image and the second image; align content of the first image and the second image prior to generating the compound image; and locate an object in the compound image.
The system of example 1, wherein the plurality of digital images is of the object, and wherein the object has a retroreflective material.
The system of example 1 or 2, wherein the processor is to align the content of the first image and the second image by computing a displacement vector representing the displacement of the content between the first image and the second image, and then applying the displacement vector to pixels of the either the first image or the second image.
The system of example 3, wherein the plurality of digital images further includes a third image acquired with light emitted from the illuminator, wherein the first image, the second image, and the third image are successive in time, wherein the displacement vector is a first displacement vector, and wherein computing the first displacement vector comprises: computing a second displacement vector representing the displacement of the content between the first image and the third image; and obtaining the first displacement vector from the second displacement vector.
The system of any one of examples 1 to 4, further comprising the illuminator.
The system of example 5, wherein the processor is to control the image acquisition device to acquire the first image and control the illuminator to emit the light in order to synchronize light emission by the illuminator with acquisition of the first image.
The system of any one of examples 1 to 6, wherein the processor is to generate the compound image from the first image and the second image by combining the first image and the second image.
The system of example 7, wherein the processor is to generate the compound image by subtracting the second image from the first image.
The system of any one of examples 1 to 8, wherein the processor is further to identify a feature of the object in the second image.
The system of example 9, wherein the feature identified in the second image is a second instance of the identified feature, wherein the processor is to also identify the feature in the first image to obtain a first instance of the identified feature, and wherein the processor is further to select one of the first instance and the second instance as a selected identified feature.
The system of example 9 or 10, wherein the object is a license plate, and the feature is a vehicle registration identifier on the license plate.
The system of any one of examples 1 to 10, wherein the object is a license plate.
A system comprising: a memory to store a plurality of digital images, wherein the plurality of digital images includes a first image that was acquired with light emitted from an illuminator, and a second image that was acquired without or with less of the light emitted from the illuminator; a processor to: generate a compound image from the first image and the second image, content of the first image and the second image having been aligned; and locate an object in the compound image.
The system of example 13, wherein the plurality of digital images is of the object, and optionally wherein the object is a license plate having a retroreflective material.
The system of example 13 or 14, wherein the processor is to align the content of the first image and the second image by computing a displacement vector representing the displacement of the content between the first image and the second image, and then applying the displacement vector to pixels of the either the first image or the second image.
A method comprising: activating an illuminator to emit light, and acquiring a first image with light emitted from the illuminator; acquiring a second image without or with less of the light emitted from the illuminator; generating a compound image from the first image and the second image; aligning content of the first image and the second image prior to generating the compound image; and locating an object in the compound image.
The method of example 16, wherein both the first image and the second image are of the object, and wherein the object has a retroreflective material.
The method of example 16 or 17, wherein the aligning comprises computing a displacement vector representing the displacement of the content between the first image and the second image, and then applying the displacement vector to pixels of the either the first image or the second image.
The method of example 18, further comprising acquiring a third image with light emitted from the illuminator, wherein the first image, the second image, and the third image are successive in time, wherein the displacement vector is a first displacement vector, and wherein computing the first displacement vector comprises: computing a second displacement vector representing the displacement of the content between the first image and the third image; and obtaining the first displacement vector from the second displacement vector.
The method of any one of examples 16 to 19, comprising controlling an image acquisition device to acquire the first image and controlling the illuminator to emit the light in order to synchronize light emission by the illuminator with acquisition of the first image.
The method of any one of examples 16 to 20, wherein generating the compound image comprises combining the first image and the second image.
The method of example 21, wherein generating the compound image comprises subtracting the second image from the first image.
The method of any one of examples 16 to 22, further comprising identifying a feature of the object in the second image.
The method of example 23, wherein the feature identified in the second image is a second instance of the identified feature, and wherein the method further comprises identifying the feature in the first image to obtain a first instance of the identified feature, and selecting one of the first instance and the second instance as a selected identified feature.
The method of example 23 or 24, wherein the object is a license plate, and the feature is a vehicle registration identifier on the license plate.
The method of any one of examples 16 to 24, wherein the object is a license plate.
A method comprising: retrieving from memory a plurality of digital images, wherein the plurality of digital images includes a first image that was acquired with light emitted from an illuminator, and a second image that was acquired without or with less of the light emitted from the illuminator; generating a compound image from the first image and the second image; aligning content of the first image and the second image prior to generating the compound image; and locating an object in the compound image.
The method of example 27, wherein the plurality of digital images is of the object, and optionally wherein the object is a license plate having a retroreflective material.
The method of example 27 or 28, wherein the aligning comprises computing a displacement vector representing the displacement of the content between the first image and the second image, and then applying the displacement vector to pixels of the either the first image or the second image.
Although the foregoing has been described with reference to certain specific embodiments, various modifications thereof will be apparent to those skilled in the art without departing from the scope of the claims appended hereto.
Blais-Morin, Louis-Antoine, Cassani, Pablo Agustin
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
10691968, | Feb 08 2018 | GENETEC INC | Systems and methods for locating a retroreflective object in a digital image |
10755571, | Mar 01 2019 | Amazon Technologies, Inc.; Amazon Technologies, Inc | Identifying parking location using single camera reverse projection |
5381155, | Dec 08 1993 | Vehicle speeding detection and identification | |
5757286, | May 28 1993 | Kapsch TrafficCom AB | Method and a device for the registration of the movement of a vehicle |
6404902, | Jul 28 1997 | Sharp Kabushiki Kaisha | Image processing apparatus |
6650765, | Jan 11 2000 | Pulnix America, Inc. | System for simultaneously imaging vehicles and their license plates |
7982634, | Mar 22 2006 | KRIA S R L | System for detecting vehicles |
8526677, | Jul 16 2012 | GOOGLE LLC | Stereoscopic camera with haptic feedback for object and location detection |
9087380, | May 26 2004 | PLAYDATA SYSTEMS, INC | Method and system for creating event data and making same available to be served |
9142010, | Jan 04 2012 | Knowles Electronics, LLC | Image enhancement based on combining images from multiple cameras |
9270875, | Jul 20 2011 | AVAGO TECHNOLOGIES INTERNATIONAL SALES PTE LIMITED | Dual image capture processing |
9927253, | May 11 2016 | Ubicquia IQ LLC | System and stereoscopic range determination method for a roadway lighting system |
20100131197, | |||
20100151213, | |||
20120242809, | |||
20130148846, | |||
20130162799, | |||
20130299517, | |||
20150163390, | |||
20160232410, | |||
20160241839, | |||
20160300354, | |||
20170075888, | |||
20170148102, | |||
20170286794, | |||
20180137368, | |||
20180268687, | |||
20190166355, | |||
20190244045, | |||
20190287297, | |||
20190318622, | |||
20200311435, | |||
20200327343, | |||
20210124966, | |||
CN105182320, | |||
CN105844266, | |||
CN106682611, | |||
CN107195004, | |||
CN108022429, | |||
CN109544707, | |||
CN110264729, | |||
CN110285793, | |||
CN110322702, | |||
CN110458897, | |||
CN110930716, | |||
CN203165216, | |||
CN207380911, | |||
CN209625442, | |||
JP2007047876, | |||
KR100541865, | |||
KR101151523, | |||
KR101392357, | |||
KR101794308, | |||
KR101828558, | |||
KR20050020350, | |||
KR20120000373, | |||
KR20140000402, | |||
WO2011044149, | |||
WO2017059527, | |||
WO2018049514, | |||
WO2019066770, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
May 25 2018 | BLAIS-MORIN, LOUIS-ANTOINE | GENETEC INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 052559 | /0335 | |
Jul 09 2018 | CASSANI, PABLO AGUSTIN | GENETEC INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 052559 | /0335 | |
May 04 2020 | GENETEC INC. | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
May 04 2020 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Date | Maintenance Schedule |
Jun 21 2025 | 4 years fee payment window open |
Dec 21 2025 | 6 months grace period start (w surcharge) |
Jun 21 2026 | patent expiry (for year 4) |
Jun 21 2028 | 2 years to revive unintentionally abandoned end. (for year 4) |
Jun 21 2029 | 8 years fee payment window open |
Dec 21 2029 | 6 months grace period start (w surcharge) |
Jun 21 2030 | patent expiry (for year 8) |
Jun 21 2032 | 2 years to revive unintentionally abandoned end. (for year 8) |
Jun 21 2033 | 12 years fee payment window open |
Dec 21 2033 | 6 months grace period start (w surcharge) |
Jun 21 2034 | patent expiry (for year 12) |
Jun 21 2036 | 2 years to revive unintentionally abandoned end. (for year 12) |