systems, methods, models, and training data for models are discussed, for determining vehicle positioning, and in particular identifying tailgating. Simulated training images showing vehicles following other vehicles, under various conditions, are generated using a virtual environment. Models are trained to determine following distance between two vehicles. Trained models are used in detection of tailgating, based on determined distance between two vehicles. Results of tailgating are output to warn a driver, or to provide a report on driver behavior. Following distance over time is determined, and simplified following distance data is generated for use at a management device.
|
1. A system for determining a following distance between a first vehicle and a second vehicle, the system comprising:
at least one processor; and
at least one non-transitory processor-readable storage medium communicatively coupled to the at least one processor and storing processor-executable instructions which when executed by the at least one processor cause the system to:
access an image including a representation of the first vehicle from a perspective of the second vehicle behind the first vehicle, the image further representing a common lane of travel of the first vehicle and the second vehicle;
determine, by the at least one processor, a first vertical position in the image representing a bottom of the first vehicle;
access, by the at least one processor, a second vertical position in the image representing a static physical distance from the second vehicle;
determine a transformed first vertical position by applying, by the at least one processor, an image transformation matrix to the first vertical position;
determine a transformed second vertical position by applying, by the at least one processor, the image transformation matrix to the second vertical position;
determine an image distance between the transformed first vertical position and the transformed second vertical position;
determine the following distance as a physical distance between the first vehicle and the second vehicle based on the determined image distance and the static physical distance; and
output the determined following distance.
2. The system of
3. The system of
4. The system of
5. The system of
determine a first boundary of the common lane of travel;
determine a second boundary of the common lane of travel;
determine a vanishing point for the image as a point where the first boundary and the second boundary intersect;
access a region of interest in the image, where a bottom edge and a top edge of the region of interest are parallel to a horizontal axis of the image, a left edge of the region of interest extends from first point left of the common lane of travel towards the vanishing point, and a right edge of the region of interest extends from a second point right of the common lane of travel towards the vanishing point;
determine a transformed region of interest where a left edge of the transformed region of interest is parallel to a right edge of the transformed region of interest; and
determine the image transformation matrix as a matrix which when applied to the region of interest, transforms the region of interest to the determined transformed region of interest.
6. The system of
7. The system of
8. The system of
9. The system of
10. The system of
11. The system of
determine a ratio of image distance to physical distance for a vertical axis of the image as transformed according to the image transformation matrix, based on a received measurement for a physical distance coaxial to the common lane and an image distance coaxial to the common lane as represented in the transformed region of interest.
12. The system of
determine a ratio of image distance to physical distance for a horizontal axis of the image as transformed according to the image transformation matrix, based on a received measurement for a physical width of the common lane and an image distance width of the common lane as represented in the transformed region of interest.
13. The system of
determine a ratio of image distance to physical distance for a vertical axis of the image as transformed according to the image transformation matrix, based on the ratio of image distance to physical distance for the horizontal axis of the image and based on the image transformation matrix.
14. The system of
15. The system of
determine, by the at least one processor, whether the following distance is within a tailgating distance criteria;
identify, by the at least one processor, that the second vehicle is not tailgating the first vehicle if the following distance is outside of the tailgating distance criteria; and
identify, by the at least one processor, that the second vehicle is tailgating the first vehicle if tailgating criteria are met, wherein the tailgating criteria includes the following distance being within the tailgating distance criteria.
16. The system of
17. The system of
18. The system of
19. The system of
20. The system of
|
This application claims priority to: U.S. Provisional Patent Application No. 63/456,179, titled “Systems and Methods for Detecting Vehicle Following Distance”, filed on Mar. 31, 2023; to U.S. Provisional Patent Application No. 63/526,233, titled “Systems and Methods for Detecting Vehicle Following Distance”, filed on Jul. 12, 2023; to U.S. Provisional Patent Application No. 63/537,875, titled “Systems and Methods for Detecting Vehicle Following Distance”, filed on Sep. 12, 2023; and to U.S. Provisional Patent Application No. 63/606,307, titled “Systems and Methods for Detecting Vehicle Following Distance”, filed on Dec. 5, 2023, each of which is incorporated in its entirety by reference herein.
The present disclosure generally relates to systems and methods for determining vehicle positioning, and in particular relate to determining vehicle following distance.
Monitoring vehicle movement and positioning is advantageous for fleet managers for a variety of reasons, including improving the safety of their fleet. Via real time monitoring, inappropriate behavior or dangerous situations can be identified, and a driver can be immediately alerted of the dangerous situation. Reports can be prepared indicating or summarizing dangerous situations. Such alerts or reports may reduce occurrence of traffic accidents. Further, monitoring vehicle movement and positioning is also useful in self-driving (autonomous) vehicles.
According to a broad aspect, the present disclosure describes a method for creating training data for training an artificial intelligence to predict a distance between two vehicles, the method comprising: for each instance in a first plurality of instances: accessing respective parameter data, the respective parameter data indicating at least a first position of a first vehicle and a second position of a virtual camera, the first position and the second position specific to the instance, the virtual camera representing a perspective from a second vehicle positioned behind the first vehicle, facing towards the first vehicle; simulating, by at least one processor in a virtual environment, the first vehicle at the first position and the virtual camera at the second position; rendering, by the at least one processor in the virtual environment, at least one image for the instance from the perspective represented by the virtual camera; and outputting the at least one image for the instance, associated with a label indicative of a distance between the first vehicle and the second vehicle; and storing, by at least one non-transitory processor-readable storage medium, a first plurality of images including each at least one image output for each instance of the plurality of instances associated with the respective label indicating distance between the first vehicle and the second vehicle for the respective instance.
The method may further comprise: for each instance in a second plurality of instances: accessing respective parameter data, the respective parameter data indicating at least a third position of a virtual camera representing a perspective from a third vehicle; simulating, by the at least one processor in the virtual environment, the virtual camera at the third position; rendering, by the at least one processor in the virtual environment, at least one image for the instance from a perspective represented by the virtual camera at the third position; and outputting the at least one image for the instance, associated with a label indicative of a distance between two vehicles which is a null value; and storing, by the at least one non-transitory processor-readable storage medium, a second plurality of images including each at least one image output for each instance of the second plurality of instances associated with the respective label indicating a distance between two vehicles which is a null value.
For each instance in the first plurality of instances: the respective parameter data may further indicate the distance between the first vehicle and the second vehicle; and the label indicative of the distance between the first vehicle and the second vehicle may indicate the distance between the first vehicle and the second vehicle as included in the respective parameter data.
The method may further comprise, for each instance in the first plurality of instances: determining, by the at least one processor, the distance between the first vehicle and the second vehicle by determining a difference between the first position and the second position.
For each instance in the first plurality of instances, accessing the respective parameter data may comprise receiving the respective parameter data as user input via a user input device.
For each instance in the first plurality of instances, accessing the respective parameter data may comprise autonomously generating, by the at least one processor, the respective parameter data. For each instance in the first plurality of instances, autonomously generating the respective parameter data may comprise: autonomously determining random values for the first position and the second position, within a defined distance threshold.
For each instance in the first plurality of instances, outputting the at least one image for the instance may comprise outputting the at least one image for the instance associated with a distance label indicative of a distance between the first vehicle and the second vehicle and associated with a vehicle presence label indicative of whether the first vehicle is within a vehicle presence threshold of the second vehicle. The method may further comprise, for each instance in the first plurality of instances: generating, by the at least one processor, the vehicle presence label indicative of whether the first vehicle is within a vehicle presence threshold of the second vehicle, based on relative positions of the first vehicle and the second vehicle.
For each instance in the first plurality of instances: the respective parameter data may further indicate a resolution for the virtual camera; and rendering, by the at least one processor in the virtual environment, at least one image for the instance from the perspective represented by the virtual camera may comprise rendering the at least one image for the instance at the resolution for the virtual camera.
For each instance in the first plurality of instances, the respective parameter data may further indicate at least one parameter selected from a group of parameters consisting of: type of the first vehicle; type of the second vehicle; dimensions of the first vehicle; dimensions of the second vehicle; properties of the first vehicle; properties of the second vehicle; position and orientation of the virtual camera relative to the second vehicle; lens attributes of the virtual camera; weather conditions; lighting conditions; time of day; and date.
The method may further comprise: selecting a subset of instances from the first plurality of instances; for each instance in the subset of instances: autonomously applying a distortion effect to the at least one image output for the instance. The distortion effect may include at least one distortion effect selected from a group of distortion effects comprising: image compression loss; pixel value distribution; adversarial effect; image noise; image saturation; and image blur.
The method may further comprise: selecting a subset of instances from the first plurality of instances; for each instance in the subset of instances: autonomously applying an environmental effect to the at least one image output for the instance. The environmental effect may include at least one environmental effect selected from a group of environmental effects comprising: rain; snow; and fog.
For each instance in the first plurality of instances, rendering, by the at least one processor in the virtual environment, at least one image for the instance from the perspective represented by the virtual camera may comprise: rendering, by the at least one processor in the virtual environment, a single image for the instance from the perspective represented by the virtual camera.
For each instance in the first plurality of instances, rendering, by the at least one processor in the virtual environment, at least one image for the instance from the perspective represented by the virtual camera may comprise: rendering, by the at least one processor in the virtual environment, a plurality of images for the instance from the perspective represented by the virtual camera, each image of the plurality of images for the instance representing a respective moment in time. For each instance in the first plurality of instances, simulating, by at least one processor in a virtual environment, the first vehicle at the first position and the virtual camera at the second position may comprise: simulating, by the at least one processor in the virtual environment, movement of the first vehicle and movement of the virtual camera over each respective moment in time represented by the plurality of images for the instance.
For each instance in the first plurality of instances, the first position of the first vehicle may indicate a longitudinal position and lateral position of the first vehicle.
For each instance in the first plurality of instances, the second position of the virtual camera may indicate a longitudinal position and a lateral position of the virtual camera within a road lane and a height of the virtual camera.
According to another broad aspect, the present disclosure describes a system for creating training data for training an artificial intelligence to predict a distance between two vehicles, the system comprising: at least one processor; at least one non-transitory processor-readable storage medium communicatively coupled to the at least one processor and storing processor-executable instructions which when executed by the at least one processor cause the system to: for each instance in a first plurality of instances: access respective parameter data, the respective parameter data indicating at least a first position of a first vehicle and a second position of a virtual camera, the first position and the second position specific to the instance, the virtual camera representing a perspective from a second vehicle positioned behind the first vehicle, facing towards the first vehicle; simulate, by the at least one processor in a virtual environment, the first vehicle at the first position and the virtual camera at the second position; render, by the at least one processor in the virtual environment, at least one image for the instance from the perspective represented by the virtual camera; and output the at least one image for the instance, associated with a label indicative of a distance between the first vehicle and the second vehicle; and store, by the at least one non-transitory processor-readable storage medium, a first plurality of images including each at least one image output for each instance of the plurality of instances associated with the respective label indicating distance between the first vehicle and the second vehicle for the respective instance.
The processor-executable instructions may further cause the system to: for each instance in a second plurality of instances: access respective parameter data, the respective parameter data indicating at least a third position of a virtual camera representing a perspective from a third vehicle; simulate, by the at least one processor in the virtual environment, the virtual camera at the third position; render, by the at least one processor in the virtual environment, at least one image for the instance from a perspective represented by the virtual camera at the third position; and output the at least one image for the instance, associated with a label indicative of a distance between two vehicles which is a null value; and store, by the at least one non-transitory processor-readable storage medium, a second plurality of images including each at least one image output for each instance of the second plurality of instances associated with the respective label indicating a distance between two vehicles which is a null value.
For each instance in the first plurality of instances: the respective parameter data may further indicate the distance between the first vehicle and the second vehicle; and the label indicative of the distance between the first vehicle and the second vehicle may indicate the distance between the first vehicle and the second vehicle as included in the respective parameter data.
The processor-executable instructions may further cause the system to, for each instance in the first plurality of instances: determine, by the at least one processor, the distance between the first vehicle and the second vehicle by determining a difference between the first position and the second position.
For each instance in the first plurality of instances, the processor-executable instructions which cause the system to access the respective parameter data may cause the system to receive the respective parameter data as user input via a user input device.
For each instance in the first plurality of instances, the processor-executable instructions which cause the system to access the respective parameter data may cause the at least one processor to autonomously generate the respective parameter data. For each instance in the first plurality of instances, the processor-executable instructions which cause the at least one processor to autonomously generate the respective parameter data cause the at least one processor to: autonomously determine random values for the first position and the second position, within a defined distance threshold.
For each instance in the first plurality of instances, the processor-executable instructions which cause the system to output the at least one image for the instance may cause the system to: output the at least one image for the instance associated with a distance label indicative of a distance between the first vehicle and the second vehicle and associated with a vehicle presence label indicative of whether the first vehicle is within a vehicle presence threshold of the second vehicle. The processor-executable instructions may further cause the system to, for each instance in the first plurality of instances: generate, by the at least one processor, the vehicle presence label indicative of whether the first vehicle is within the vehicle presence threshold of the second vehicle, based on relative positions of the first vehicle and the second vehicle.
For each instance in the first plurality of instances: the respective parameter data may further indicate a resolution for the virtual camera; and the processor-executable instructions which cause the system to render, by the at least one processor in the virtual environment, at least one image for the instance from the perspective represented by the virtual camera may cause the at least one processor to: render the at least one image for the instance at the resolution for the virtual camera.
For each instance in the first plurality of instances, the respective parameter data may further indicate at least one parameter selected from a group of parameters consisting of: type of the first vehicle; type of the second vehicle; dimensions of the first vehicle; dimensions of the second vehicle; properties of the first vehicle; properties of the second vehicle; position and orientation of the virtual camera relative to the second vehicle; lens attributes of the virtual camera; weather conditions; lighting conditions; time of day; and date.
The processor-executable instructions may further cause the at least one processor to: select a subset of instances from the first plurality of instances; for each instance in the subset of instances: autonomously apply a distortion effect to the at least one image output for the instance. The distortion effect may include at least one distortion effect selected from a group of distortion effects comprising: image compression loss; pixel value distribution; adversarial effect; image noise; image saturation; and image blur.
The processor-executable instructions may further cause the at least one processor to: select a subset of instances from the first plurality of instances; for each instance in the subset of instances: autonomously apply an environmental effect to the at least one image output for the instance. The environmental effect may include at least one environmental effect selected from a group of environmental effects comprising: rain; snow; and fog.
For each instance in the first plurality of instances, the processor-executable instructions which cause the system to render, by the at least one processor in the virtual environment, at least one image for the instance from the perspective represented by the virtual camera may cause the system to: render, by the at least one processor in the virtual environment, a single image for the instance from the perspective represented by the virtual camera.
For each instance in the first plurality of instances, the processor-executable instructions which cause the system to render, by the at least one processor in the virtual environment, at least one image for the instance from the perspective represented by the virtual camera may cause the system to: render, by the at least one processor in the virtual environment, a plurality of images for the instance from the perspective represented by the virtual camera, each image of the plurality of images for the instance representing a respective moment in time. For each instance in the first plurality of instances, the processor-executable instructions which cause the system to simulate, by the at least one processor in the virtual environment, the first vehicle at the first position and the virtual camera at the second position may cause the system to: simulate, by the at least one processor in the virtual environment, movement of the first vehicle and movement of the virtual camera over each respective moment in time represented by the plurality of images for the instance.
For each instance in the first plurality of instances, the first position of the first vehicle may indicate a longitudinal position and lateral position of the first vehicle.
For each instance in the first plurality of instances, the second position of the virtual camera may indicate a longitudinal position and a lateral position of the virtual camera within a road lane and a height of the virtual camera.
According to yet another broad aspect, the present disclosure describes a method for training a model for determining a distance between a first vehicle and second vehicle comprising: accessing image data, the image data including at least a first set of images, each image in the first set of images including a representation of a respective first vehicle from a perspective of a second respective vehicle behind the first respective vehicle, and each image in the first set of images associated with a distance label indicating a distance between the respective first vehicle and the respective second vehicle; evaluating a following distance loss function for each image in the first set of images, the following loss function including a first term representing a difference between a distance indicated in a respective distance label and a determined distance between the first vehicle and the second vehicle by the model for each respective image; and training the model by minimizing the following distance loss function over the first set of images.
Each image in the first set of images may be further associated with a vehicle presence label indicating whether the distance between the first vehicle is within a vehicle presence threshold of the second vehicle. The following distance loss function may further include a second term representing a difference between the vehicle presence label and a determined vehicle presence for each respective image.
The method may further comprise determining, for each image in the first set of images, whether the first vehicle is within a vehicle presence threshold of the second vehicle, and generating a vehicle presence label associated with each image indicating whether the first vehicle is within the vehicle presence threshold of the second vehicle. The following distance loss function may further include a second term representing a difference between the vehicle presence label and a determined vehicle presence for each respective image.
The method may further comprise determining whether auxiliary criteria are satisfied over the first set of images; and further evaluating the following distance loss function for at least one image in the first set of images, if the auxiliary criteria are not satisfied. The auxiliary criteria may require that the following distance loss function be within a maximum loss threshold for each image in the first set of images. The auxiliary criteria may require that the following distance loss function be within a maximum loss threshold for a defined quantity of images in the first set of images, where the defined quantity of images is smaller than a total quantity of images in the first set of images. The auxiliary criteria may require that the following distance loss function be evaluated for each image in the first set of images. The auxiliary criteria may require that the following distance loss function be evaluated for a defined quantity of images in the first set of images, where the defined quantity of images is smaller than a total quantity of images in the first set of images.
According to yet another broad aspect, the present disclosure describes a system for training a model for determining a distance between a first vehicle and second vehicle, the system comprising: at least one processor; at least one non-transitory processor-readable storage medium communicatively coupled to the at least one processor and storing processor-executable instructions which when executed by the at least one processor cause the system to: access image data, the image data including at least a first set of images, each image in the first set of images including a representation of a respective first vehicle from a perspective of a second respective vehicle behind the first respective vehicle, and each image in the first set of images associated with a distance label indicating a distance between the respective first vehicle and the respective second vehicle; evaluate a following distance loss function for each image in the first set of images, the following loss function including a first term representing a difference between a distance indicated in a respective distance label and a determined distance between the first vehicle and the second vehicle by the model for each respective image; and train the model by minimizing the following distance loss function over the first set of images.
Each image in the first set of images may be further associated with a vehicle presence label indicating whether the first vehicle is within the vehicle presence threshold of the second vehicle. The following distance loss function may further include a second term representing a difference between the vehicle presence label and a determined vehicle presence for each respective image.
The processor-executable instructions may further cause the system to: determine, for each image in the first set of images, whether the first vehicle is within a vehicle presence threshold of the second vehicle, and generate a vehicle presence label associated with each image indicating whether the first vehicle is within the vehicle presence threshold of the second vehicle. The following distance loss function may further include a second term representing a difference between the vehicle presence label and a determined vehicle presence for each respective image.
The processor-executable instructions may further cause the system to: determine whether auxiliary criteria are satisfied over the first set of images; and further evaluate the following distance loss function for at least one image in the first set of images, if the auxiliary criteria are not satisfied. The auxiliary criteria may require that the following distance loss function be within a maximum loss threshold for each image in the first set of images. The auxiliary criteria may require that the following distance loss function be within a maximum loss threshold for a defined quantity of images in the first set of images, where the defined quantity of images is smaller than a total quantity of images in the first set of images. The auxiliary criteria may require that the following distance loss function be evaluated for each image in the first set of images. The auxiliary criteria may require that the following distance loss function be evaluated for a defined quantity of images in the first set of images, where the defined quantity of images is smaller than a total quantity of images in the first set of images.
According to yet another broad aspect, the present disclosure describes a method for identifying tailgating between a first vehicle and second vehicle comprising: accessing image data, the image data including at least at least one image, each image in the image data including a representation of a first vehicle from a perspective of a second vehicle behind the first vehicle; applying, by at least one processor, a following distance determination model to determine a following distance between the first vehicle and the second vehicle; determining, by the at least one processor, whether the following distance is within a tailgating distance criteria; identifying, by the at least one processor, that the second vehicle is not tailgating the first vehicle if the following distance is outside of the tailgating distance criteria; and identifying, by the at least one processor, that the second vehicle is tailgating the first vehicle if tailgating criteria are met, wherein the tailgating criteria includes the following distance being within the tailgating distance criteria.
The method may further comprise: identifying, by the at least one processor, a left-distance indicating a horizontal distance of the first vehicle from a left boundary of the at least one image; identifying, by the at least one processor, a right-distance indicating a horizontal distance of the first vehicle from a right boundary of the at least one image; determining, by the at least one processor, a difference between the left-distance and the right-distance; determining, determining whether the difference between the left-distance and the right-distance is within a horizontal distance criteria, wherein the tailgating criteria includes the difference between the left-distance and the right-distance being within the horizontal distance criteria; and identifying, by the at least one processor, that the second vehicle is not tailgating the first vehicle if the determined difference is outside of the horizontal distance criteria.
Identifying the left-distance may comprise identifying, by the at least one processor, a horizontal distance between a left edge of a bounding box delineating the first vehicle in the at least one image and a left edge of the at least one image; and identifying the right-distance may comprise identifying, by the at least one processor, a horizontal distance between a right edge of a bounding box delineating the first vehicle in the at least one image and a right edge of the at least one image.
Accessing the image data may comprise capturing, by at least one image capture device, the image data.
Accessing the image data may comprise receiving, by at least one communication interface communicatively coupled to the at least one processor, the image data.
Accessing the image data may comprise accessing the image data as stored in at least one non-transitory processor-readable storage medium.
Determining whether the following distance is within the tailgating distance criteria may comprise: determining a first stopping distance for the first vehicle; determining a second stopping distance for the second vehicle; determining that the following distance is within the tailgating distance criteria if the second stopping distance is greater than the first stopping distance plus the following distance; and determining that the following distance is not within the tailgating distance criteria if the second stopping distance is not greater than the first stopping distance plus the following distance. Determining the first stopping distance may comprise estimating the first stopping distance as a minimum distance for the first vehicle to stop; and determining the second stopping distance may comprise estimating the second stopping distance as a maximum distance for the second vehicle to stop.
Determining whether the determined following distance is within the tailgating distance criteria may comprise determining whether the following distance is within a tailgating distance threshold. The tailgating distance threshold may represent a safe following distance limit as a function of speed of the second vehicle.
According to yet another broad aspect, the present disclosure describes a system for identifying tailgating between a first vehicle and second vehicle, the system comprising: at least one processor; at least one non-transitory processor-readable storage medium communicatively coupled to the at least one processor and storing processor-executable instructions which when executed by the at least one processor cause the system to: access image data, the image data including at least at least one image, each image in the image data including a representation of a first vehicle from a perspective of a second vehicle behind the first vehicle; apply, by the at least one processor, a following distance determination model to determine a following distance between the first vehicle and the second vehicle; determine, by the at least one processor, whether the following distance is within a tailgating distance criteria; identify, by the at least one processor, that the second vehicle is not tailgating the first vehicle if the following distance is outside of the tailgating distance criteria; and identify, by the at least one processor, that the second vehicle is tailgating the first vehicle if tailgating criteria are met, wherein the tailgating criteria includes the following distance being within the tailgating distance criteria.
The processor-executable instructions may further cause the system to: identify, by the at least one processor, a left-distance indicating a horizontal distance of the first vehicle from a left boundary of the at least one image; identify, by the at least one processor, a right-distance indicating a horizontal distance of the first vehicle from a right boundary of the at least one image; determine, by the at least one processor, a difference between the left-distance and the right-distance; determine, by the at least one processor, whether the difference between the left-distance and the right-distance is within a horizontal distance criteria, wherein the tailgating criteria includes the difference between the left-distance and the right-distance being within the horizontal distance criteria; and identify, by the at least one processor, that the second vehicle is not tailgating the first vehicle if the determined difference is outside of the horizontal distance criteria.
The processor-executable instructions which cause the system to identify the left-distance may cause the at least one processor to: identify a horizontal distance between a left edge of a bounding box delineating the first vehicle in the at least one image and a left edge of the at least one image; and the processor-executable instructions which cause the system to identify the right-distance cause the at least one processor to: identify a horizontal distance between a right edge of a bounding box delineating the first vehicle in the at least one image and a right edge of the at least one image.
The processor-executable instructions which cause the system to access the image data may cause at least one image capture device of the system to capture the image data.
The processor-executable instructions which cause the system to access the image data may cause the system to receive, by at least one communication interface of the system communicatively coupled to the at least one processor, the image data.
The processor-executable instructions which cause the system to access the image data may cause the system to access the image data as stored in the at least one non-transitory processor-readable storage medium.
The processor-executable instructions which cause the system to determine whether the following distance is within the tailgating distance criteria may cause the at least one processor to: determine a first stopping distance for the first vehicle; determine a second stopping distance for the second vehicle; determine that the following distance is within the tailgating distance criteria if the second stopping distance is greater than the first stopping distance plus the following distance; and determine that the following distance is not within the tailgating distance criteria if the second stopping distance is not greater than the first stopping distance plus the following distance. The processor-executable instructions which cause the at least one processor to determine the first stopping distance may cause the at least one processor to estimate the first stopping distance as a minimum distance for the first vehicle to stop; and the processor-executable instructions which cause the at least one processor to determine the second stopping distance may cause the at least one processor to estimate the second stopping distance as a maximum distance for the second vehicle to stop.
The processor-executable instructions which cause the system to determine whether the following distance is within the tailgating distance criteria may cause the at least one processor to: determine whether the following distance is within a tailgating distance threshold. The tailgating distance threshold may represent a safe following distance limit as a function of speed of the second vehicle.
According to yet another broad aspect, the present disclosure describes a system for determining a following distance between a first vehicle and a second vehicle, the system comprising: at least one processor; and at least one non-transitory processor-readable storage medium communicatively coupled to the at least one processor and storing processor-executable instructions which when executed by the at least one processor cause the system to: access an image including a representation of a first vehicle from a perspective of a second vehicle behind the first vehicle; determine, by the at least one processor, at least one image attribute based on a positional measure between the first vehicle as represented in the image and at least one boundary of the image; and apply, by the at least one processor, a following distance determination model to determine a following distance based on the determined at least one image attribute.
The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as including a first distance from a bottom boundary of the image to a bottom of the first vehicle as represented in the image. The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may further cause the at least one processor to determine the at least one image attribute as further including a second distance from a top boundary of the image to a top of the first vehicle as represented in the image.
The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as including a cumulative measure of: a first positional measure between a first corner of the image and a first corner of the first vehicle as represented in the image; and a second positional measure between a second corner of the image and a second corner of the first vehicle as represented in the image.
The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as including: a first cumulative measure of: a first positional measure between a bottom-left corner of the image and a bottom-left corner of the first vehicle as represented in the image; and a second positional measure between a bottom-right corner of the image and a bottom-right corner of the first vehicle as represented in the image; and a second cumulative measure of: a third positional measure between a top-left corner of the image and a top-left corner of the first vehicle as represented in the image; and a fourth positional measure between a top-right corner of the image and a top-right corner of the first vehicle as represented in the image. The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as further including: a third cumulative measure of: a fifth positional measure between a bottom-left corner of the image and a bottom-right corner of the first vehicle as represented in the image; and a sixth positional measure between a bottom-right corner of the image and a bottom-left corner of the first vehicle as represented in the image; and a fourth cumulative measure of: a seventh positional measure between a top-left corner of the image and a top-right corner of the first vehicle as represented in the image; and an eighth positional measure between a top-right corner of the image and a top-left corner of the first vehicle as represented in the image.
The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute based on respective positional measures between each corner of the image and each corner of the first vehicle as represented in the image.
The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as including a first positional measure of a first corner of the first vehicle as represented in the image from a first corner of the image. The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as further including a second positional measure of a second corner of the first vehicle as represented in the image from the first corner of the image. The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as further including a scale measure of a size of the first vehicle as represented in the image.
The processor-executable instructions may further cause the system to: determine, by the at least one processor, whether the following distance is within a tailgating distance criteria; identify, by the at least one processor, that the second vehicle is not tailgating the first vehicle if the following distance is outside of the tailgating distance criteria; and identify, by the at least one processor, that the second vehicle is tailgating the first vehicle if tailgating criteria are met, wherein the tailgating criteria includes the following distance being within the tailgating distance criteria. The processor-executable instructions may further cause the system to, in response to an identification that the second vehicle is tailgating the first vehicle, output a tailgating indication to a driver of the second vehicle. The processor-executable instructions may further cause the system to, in response to an identification that the second vehicle is tailgating the first vehicle, output a tailgating indication to a management server.
The processor-executable instructions which cause the system to access the image may cause at least one image capture device of the system to capture the image.
The processor-executable instructions which cause the system to access the image may cause the system to receive, by at least one communication interface of the system communicatively coupled to the at least one processor, the image.
The processor-executable instructions which cause the system to access the image may cause the system to access the image as stored in the at least one non-transitory processor-readable storage medium.
According to yet another broad aspect, the present disclosure describes a method for determining a following distance between a first vehicle and a second vehicle, the method comprising: accessing an image including a representation of a first vehicle from a perspective of a second vehicle behind the first vehicle; determining, by the at least one processor, at least one image attribute based on a positional measure between the first vehicle as represented in the image and at least one boundary of the image; and applying, by the at least one processor, a following distance determination model to determine a following distance based on the determined at least one image attribute.
Determining the at least one image attribute may comprise determining the at least one image attribute as including a first distance from a bottom boundary of the image to a bottom of the first vehicle as represented in the image. Determining the at least one image attribute may further comprise determining the at least one image attribute as further including a second distance from a top boundary of the image to a top of the first vehicle as represented in the image.
Determining the at least one image attribute may comprise determining the at least one image attribute as including a cumulative measure of: a first positional measure between a first corner of the image and a first corner of the first vehicle as represented in the image; and a second positional measure between a second corner of the image and a second corner of the first vehicle as represented in the image.
Determining the at least one image attribute may comprise determining the at least one image attribute as including: a first cumulative measure of: a first positional measure between a bottom-left corner of the image and a bottom-left corner of the first vehicle as represented in the image; and a second positional measure between a bottom-right corner of the image and a bottom-right corner of the first vehicle as represented in the image; and a second cumulative measure of: a third positional measure between a top-left corner of the image and a top-left corner of the first vehicle as represented in the image; and a fourth positional measure between a top-right corner of the image and a top-right corner of the first vehicle as represented in the image. Determining the at least one image attribute may comprise determining the at least one image attribute as further including: a third cumulative measure of: a fifth positional measure between a bottom-left corner of the image and a bottom-right corner of the first vehicle as represented in the image; and a sixth positional measure between a bottom-right corner of the image and a bottom-left corner of the first vehicle as represented in the image; and a fourth cumulative measure of: a seventh positional measure between a top-left corner of the image and a top-right corner of the first vehicle as represented in the image; and an eighth positional measure between a top-right corner of the image and a top-left corner of the first vehicle as represented in the image.
Determining the at least one image attribute may comprise determining the at least one image attribute based on respective positional measures between each corner of the image and each corner of the first vehicle as represented in the image.
Determining the at least one image attribute may comprise determining the at least one image attribute as including a first positional measure of a first corner of the first vehicle as represented in the image from a first corner of the image. Determining the at least one image attribute may comprise determining the at least one image attribute as further including a second positional measure of a second corner of the first vehicle as represented in the image from the first corner of the image. Determining the at least one image attribute may comprise determining the at least one image attribute as further including a scale measure of a size of the first vehicle as represented in the image.
The method may further comprise: determining, by the at least one processor, whether the following distance is within a tailgating distance criteria; identifying, by the at least one processor, that the second vehicle is not tailgating the first vehicle if the following distance is outside of the tailgating distance criteria; and identifying, by the at least one processor, that the second vehicle is tailgating the first vehicle if tailgating criteria are met, wherein the tailgating criteria includes the following distance being within the tailgating distance criteria. The method may further comprise in response to an identification that the second vehicle is tailgating the first vehicle, outputting a tailgating indication to a driver of the second vehicle. The method may further comprise in response to an identification that the second vehicle is tailgating the first vehicle, outputting a tailgating indication to a management server.
Accessing the image may comprise at least one image capture device of the system capturing the image. Accessing the image may comprise receiving, by at least one communication interface of the system communicatively coupled to the at least one processor, the image. Accessing the image may comprise accessing the image as stored in the at least one non-transitory processor-readable storage medium.
According to yet another broad aspect, the present disclosure describes a system for training a model for determining a distance between a first vehicle and second vehicle, the system comprising: at least one processor; at least one non-transitory processor-readable storage medium communicatively coupled to the at least one processor and storing processor-executable instructions which when executed by the at least one processor cause the system to: access image data, the image data including at least a first set of images, each image in the first set of images including a representation of a respective first vehicle from a perspective of a second respective vehicle behind the first respective vehicle, and each image in the first set of images associated with a distance label indicating a distance between the respective first vehicle and the respective second vehicle; determine, by the at least one processor, for at least one image in the first set of image, at least one image attribute based on a positional measure between the first vehicle as represented in the image and at least one boundary of the image; evaluate a following distance loss function for the at least one image in the first set of images, the following distance loss function representing a difference between a distance indicated in a respective distance label and a determined distance between the first vehicle and the second vehicle by the model for each respective image; and train the model by minimizing the following distance loss function over the first set of images.
The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as including a first distance from a bottom boundary of the image to a bottom of the first vehicle as represented in the image. The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may further cause the at least one processor to determine the at least one image attribute as further including a second distance from a top boundary of the image to a top of the first vehicle as represented in the image.
The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as including a cumulative measure of: a first positional measure between a first corner of the image and a first corner of the first vehicle as represented in the image; and a second positional measure between a second corner of the image and a second corner of the first vehicle as represented in the image.
The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as including: a first cumulative measure of: a first positional measure between a bottom-left corner of the image and a bottom-left corner of the first vehicle as represented in the image; and a second positional measure between a bottom-right corner of the image and a bottom-right corner of the first vehicle as represented in the image; and a second cumulative measure of: a third positional measure between a top-left corner of the image and a top-left corner of the first vehicle as represented in the image; and a fourth positional measure between a top-right corner of the image and a top-right corner of the first vehicle as represented in the image. The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as further including: a third cumulative measure of: a fifth positional measure between a bottom-left corner of the image and a bottom-right corner of the first vehicle as represented in the image; and a sixth positional measure between a bottom-right corner of the image and a bottom-left corner of the first vehicle as represented in the image; and a fourth cumulative measure of: a seventh positional measure between a top-left corner of the image and a top-right corner of the first vehicle as represented in the image; and an eighth positional measure between a top-right corner of the image and a top-left corner of the first vehicle as represented in the image.
The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute based on respective positional measures between each corner of the image and each corner of the first vehicle as represented in the image.
The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as including a first positional measure of a first corner of the first vehicle as represented in the image from a first corner of the image. The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as further including a second positional measure of a second corner of the first vehicle as represented in the image from the first corner of the image. The processor-executable instructions which cause the at least one processor to determine the at least one image attribute may cause the at least one processor to determine the at least one image attribute as further including a scale measure of a size of the first vehicle as represented in the image.
The processor-executable instructions may further cause the system to: determine whether auxiliary criteria are satisfied over the first set of images; and further evaluate the following distance loss function for at least one image in the first set of images, if the auxiliary criteria are not satisfied. The auxiliary criteria may require that the following distance loss function be within a maximum loss threshold for each image in the first set of images. The auxiliary criteria may require that the following distance loss function be within a maximum loss threshold for a defined quantity of images in the first set of images, where the defined quantity of images is smaller than a total quantity of images in the first set of images. The auxiliary criteria may require that the following distance loss function be evaluated for each image in the first set of images. The auxiliary criteria may require that the following distance loss function be evaluated for a defined quantity of images in the first set of images, where the defined quantity of images is smaller than a total quantity of images in the first set of images.
According to yet another broad aspect, the present disclosure describes a method for training a model for determining a distance between a first vehicle and second vehicle, the method comprising: accessing image data, the image data including at least a first set of images, each image in the first set of images including a representation of a respective first vehicle from a perspective of a second respective vehicle behind the first respective vehicle, and each image in the first set of images associated with a distance label indicating a distance between the respective first vehicle and the respective second vehicle; determining, by the at least one processor, for at least one image in the first set of images, at least one image attribute based on a positional measure between the first vehicle as represented in the image and at least one boundary of the image; evaluating a following distance loss function for the at least one image in the first set of images, the following distance loss function representing a difference between a distance indicated in a respective distance label and a determined distance between the first vehicle and the second vehicle by the model for each respective image; and training the model by minimizing the following distance loss function over the first set of images.
Determining the at least one image attribute may comprise determining the at least one image attribute as including a first distance from a bottom boundary of the image to a bottom of the first vehicle as represented in the image. Determining the at least one image attribute may further comprise determining the at least one image attribute as further including a second distance from a top boundary of the image to a top of the first vehicle as represented in the image.
Determining the at least one image attribute may comprise determining the at least one image attribute as including a cumulative measure of: a first positional measure between a first corner of the image and a first corner of the first vehicle as represented in the image; and a second positional measure between a second corner of the image and a second corner of the first vehicle as represented in the image.
Determining the at least one image attribute may comprise determining the at least one image attribute as including: a first cumulative measure of: a first positional measure between a bottom-left corner of the image and a bottom-left corner of the first vehicle as represented in the image; and a second positional measure between a bottom-right corner of the image and a bottom-right corner of the first vehicle as represented in the image; and a second cumulative measure of: a third positional measure between a top-left corner of the image and a top-left corner of the first vehicle as represented in the image; and a fourth positional measure between a top-right corner of the image and a top-right corner of the first vehicle as represented in the image. Determining the at least one image attribute may comprise determining the at least one image attribute as further including: a third cumulative measure of: a fifth positional measure between a bottom-left corner of the image and a bottom-right corner of the first vehicle as represented in the image; and a sixth positional measure between a bottom-right corner of the image and a bottom-left corner of the first vehicle as represented in the image; and a fourth cumulative measure of: a seventh positional measure between a top-left corner of the image and a top-right corner of the first vehicle as represented in the image; and an eighth positional measure between a top-right corner of the image and a top-left corner of the first vehicle as represented in the image.
Determining the at least one image attribute may comprise determining the at least one image attribute based on respective positional measures between each corner of the image and each corner of the first vehicle as represented in the image.
Determining the at least one image attribute may comprise determining the at least one image attribute as including a first positional measure of a first corner of the first vehicle as represented in the image from a first corner of the image. Determining the at least one image attribute may comprise determining the at least one image attribute as further including a second positional measure of a second corner of the first vehicle as represented in the image from the first corner of the image. Determining the at least one image attribute may comprise determining the at least one image attribute as further including a scale measure of a size of the first vehicle as represented in the image.
The method may further comprise: determining whether auxiliary criteria are satisfied over the first set of images; and further evaluating the following distance loss function for at least one image in the first set of images, if the auxiliary criteria are not satisfied. The auxiliary criteria may require that the following distance loss function be within a maximum loss threshold for each image in the first set of images. The auxiliary criteria may require that the following distance loss function be within a maximum loss threshold for a defined quantity of images in the first set of images, where the defined quantity of images is smaller than a total quantity of images in the first set of images. The auxiliary criteria may require that the following distance loss function be evaluated for each image in the first set of images. The auxiliary criteria may require that the following distance loss function be evaluated for a defined quantity of images in the first set of images, where the defined quantity of images is smaller than a total quantity of images in the first set of images.
According to yet another broad aspect, the present disclosure describes a system for determining a following distance between a first vehicle and a second vehicle, the system comprising: at least one processor; and at least one non-transitory processor-readable storage medium communicatively coupled to the at least one processor and storing processor-executable instructions which when executed by the at least one processor cause the system to: access an image including a representation of the first vehicle from a perspective of the second vehicle behind the first vehicle, the image further representing a common lane of travel of the first vehicle and the second vehicle; determine, by the at least one processor, a first vertical position in the image representing a bottom of the first vehicle; access, by the at least one processor, a second vertical position in the image representing a static physical distance from the second vehicle; determine a transformed first vertical position by applying, by the at least one processor, an image transformation matrix to the first vertical position; determine a transformed second vertical position by applying, by the at least one processor, the image transformation matrix to the second vertical position; determine an image distance between the transformed first vertical position and the transformed second vertical position; determine the following distance as a physical distance between the first vehicle and the second vehicle based on the determined image distance and the static physical distance; and output the determined following distance.
The first vertical position may represent a bottom boundary of a bounding box which encompasses the first vehicle. The second vertical position may represent a distal end of a hood of the second vehicle as represented in the image.
The image transformation matrix may represent a transformation of the image to a transformed image having a fixed relationship between pixel size in the transformed image and physical distance represented by the transformed image.
The processor-executable instructions may further cause the system to: determine a first boundary of the common lane of travel; determine a second boundary of the common lane of travel; determine a vanishing point for the image as a point where the first boundary and the second boundary intersect; access a region of interest in the image, where a bottom edge and a top edge of the region of interest are parallel to a horizontal axis of the image, a left edge of the region of interest extends from first point left of the common lane of travel towards the vanishing point, and a right edge of the region of interest extends from a second point right of the common lane of travel towards the vanishing point; determine a transformed region of interest where a left edge of the transformed region of interest is parallel to a right edge of the transformed region of interest; and determine the image transformation matrix as a matrix which when applied to the region of interest, transforms the region of interest to the determined transformed region of interest. The first point may be at a left boundary of the image and the second point may be at a right boundary of the image. The bottom edge of the region of interest may be positioned above a hood of the second vehicle as represented in the image, and the top edge of the region of interest may be positioned below the vanishing point. The processor-executable instructions which cause the system to access the region of interest in the image may cause the system to receive an indication of the region of interest as user input via a user input device. The processor-executable instructions which cause the system to access the region of interest in the image may cause the at least one processor to generate boundaries of the region of interest. The processor executable instructions which cause the system to determine the transformed region of interest may cause the at least one processor to determine four corners points of a rectangle corresponding to the transformed region of interest. The processor-executable instructions may further cause the at least one processor to: determine a ratio of image distance to physical distance for a vertical axis of the image as transformed according to the image transformation matrix, based on a received measurement for a physical distance coaxial to the common lane and an image distance coaxial to the common lane as represented in the transformed region of interest. The processor-executable instructions may further cause the at least one processor to: determine a ratio of image distance to physical distance for a horizontal axis of the image as transformed according to the image transformation matrix, based on a received measurement for a physical width of the common lane and an image distance width of the common lane as represented in the transformed region of interest. The processor-executable instructions may further cause the at least one processor to: determine a ratio of image distance to physical distance for a vertical axis of the image as transformed according to the image transformation matrix, based on the ratio of image distance to physical distance for the horizontal axis of the image and based on the image transformation matrix.
The static physical distance may be received as a physical measurement between a front of the second vehicle and content represented in the image at the second vertical position.
The processor-executable instructions may further cause the system to: determine, by the at least one processor, whether the following distance is within a tailgating distance criteria; identify, by the at least one processor, that the second vehicle is not tailgating the first vehicle if the following distance is outside of the tailgating distance criteria; and identify, by the at least one processor, that the second vehicle is tailgating the first vehicle if tailgating criteria are met, wherein the tailgating criteria includes the following distance being within the tailgating distance criteria. The processor-executable instructions may further cause the system to, in response to an identification that the second vehicle is tailgating the first vehicle, output a tailgating indication to a driver of the second vehicle. The processor-executable instructions may further cause the system to, in response to an identification that the second vehicle is tailgating the first vehicle, output a tailgating indication to a management server.
The processor-executable instructions which cause the system to access the image may cause at least one image capture device of the system to capture the image.
The processor-executable instructions which cause the system to access the image may cause the system to receive, by at least one communication interface of the system communicatively coupled to the at least one processor, the image.
The processor-executable instructions which cause the system to access the image may cause the system to access the image as stored in the at least one non-transitory processor-readable storage medium.
According to yet another broad aspect, the present disclosure describes a method for determining a following distance between a first vehicle and a second vehicle, the method comprising: accessing an image including a representation of the first vehicle from a perspective of the second vehicle behind the first vehicle, the image further representing a common lane of travel of the first vehicle and the second vehicle; determining, by at least one processor, a first vertical position in the image representing a bottom of the first vehicle; accessing, by the at least one processor, a second vertical position in the image representing a static physical distance from the second vehicle; determining a transformed first vertical position by applying, by the at least one processor, an image transformation matrix to the first vertical position; determining a transformed second vertical position by applying, by the at least one processor, the image transformation matrix to the second vertical position; determining an image distance between the transformed first vertical position and the transformed second vertical position; determining the following distance as a physical distance between the first vehicle and the second vehicle based on the determined image distance and the static physical distance; and outputting the determined following distance.
The first vertical position may represent a bottom boundary of a bounding box which encompasses the first vehicle. The second vertical position may represent a distal end of a hood of the second vehicle as represented in the image.
The image transformation matrix may represent a transformation of the image to a transformed image having a fixed relationship between pixel size in the transformed image and physical distance represented by the transformed image.
The method may further comprise: determining a first boundary of the common lane of travel; determining a second boundary of the common lane of travel; determining a vanishing point for the image as a point where the first boundary and the second boundary intersect; accessing a region of interest in the image, where a bottom edge and a top edge of the region of interest are parallel to a horizontal axis of the image, a left edge of the region of interest extends from first point left of the common lane of travel towards the vanishing point, and a right edge of the region of interest extends from a second point right of the common lane of travel towards the vanishing point; determining a transformed region of interest where a left edge of the transformed region of interest is parallel to a right edge of the transformed region of interest; and determining the image transformation matrix as a matrix which when applied to the region of interest, transforms the region of interest to the determined transformed region of interest. The first point may be at a left boundary of the image and the second point may be at a right boundary of the image. The bottom edge of the region of interest may be positioned above a hood of the second vehicle as represented in the image, and the top edge of the region of interest may be positioned below the vanishing point. Accessing the region of interest in the image may comprise receiving an indication of the region of interest as user input via a user input device. Accessing the region of interest in the image may comprise generating, by the at least one processor, boundaries of the region of interest. Determining the transformed region of interest may comprise determining four corner points of a rectangle corresponding to the transformed region of interest. The method may further comprise: determining a ratio of image distance to physical distance for a vertical axis of the image as transformed according to the image transformation matrix, based on a received measurement for a physical distance coaxial to the common lane and an image distance coaxial to the common lane as represented in the transformed region of interest. The method may further comprise: determining a ratio of image distance to physical distance for a horizontal axis of the image as transformed according to the image transformation matrix, based on a received measurement for a physical width of the common lane and an image distance width of the common lane as represented in the transformed region of interest. The method may further comprise: determining a ratio of image distance to physical distance for a vertical axis of the image as transformed according to the image transformation matrix, based on the ratio of image distance to physical distance for the horizontal axis of the image and based on the image transformation matrix.
The static physical distance may be received as a physical measurement between a front of the second vehicle and content represented in the image at the second vertical position.
The method may further comprise: determining, by the at least one processor, whether the following distance is within a tailgating distance criteria; identifying, by the at least one processor, that the second vehicle is not tailgating the first vehicle if the following distance is outside of the tailgating distance criteria; and identifying, by the at least one processor, that the second vehicle is tailgating the first vehicle if tailgating criteria are met, wherein the tailgating criteria includes the following distance being within the tailgating distance criteria. The method may further comprise, in response to an identification that the second vehicle is tailgating the first vehicle, outputting a tailgating indication to a driver of the second vehicle. The method may further comprise, in response to an identification that the second vehicle is tailgating the first vehicle, outputting a tailgating indication to a management server.
Accessing the image may comprise capturing the image by at least one image capture device of the system.
Accessing the image may comprise receiving, by at least one communication interface of the system communicatively coupled to the at least one processor, the image.
Accessing the image may comprise accessing the image as stored in at least one non-transitory processor-readable storage medium of the system.
According to yet another broad aspect, the present disclosure describes a method for generating following distance data over time for a subject vehicle, the method comprising: accessing a set of images, each image of the set of images being from a perspective of the subject vehicle and including a respective representation of another vehicle positioned in front of the subject vehicle, and each image of the set of images being associated with a respective time; generating following distance data based on at least a subset of images in the set of images, by: for each image in the subset of images, determining a respective physical distance between the subject vehicle and the other vehicle; and compiling the following distance data over time as a plurality of following distance data points, each following distance data point representing the determined physical distance for a respective image in the subset of images and the time associated with the respective image in the subset of images; generating simplified following distance data, by: identifying select data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data; and compiling the select data points as the simplified following distance data, excluding data points which are not identified as select data points; and outputting the simplified following distance data.
Identifying select data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data, may comprise, for each data point of the following distance data: determining a minimum difference between the data point and a corresponding reference line of the iteratively-defined reference lines; and identifying the data point as a select data point if the minimum difference between the data point and the corresponding reference line exceeds a difference threshold.
Identifying select data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data, may comprise: defining a reference line through the following distance data; identifying a candidate data point of the following distance data, where a minimum difference between the candidate data point and the reference line is greater than a minimum difference between other data points and the reference line; and if the minimum difference between the candidate data point and the reference line exceeds a difference threshold: identifying the candidate data point as a select data point for inclusion in the simplified following distance data; and defining new reference lines which intersect the candidate data point through the following distance data. Identifying select data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data, may further comprise: defining an additional reference line through the following distance data which intersects a select data point previously identified for inclusion in the simplified following distance data; identifying an additional candidate data point of the following distance data, where a minimum difference between the additional candidate data point and the additional reference line is greater than a minimum difference between other data points and the additional reference line; and if the minimum difference between the additional candidate data point and the additional reference line exceeds the difference threshold: identifying the additional candidate data point as an additional select data point for inclusion in the simplified following distance data.
The following distance data may comprise at least one data series of following distance data points; identifying select data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data, may comprise, for each data series in the following distance data, until all data points in each data series are within a distance threshold of a corresponding reference line: identifying a first data point in the data series and a last data point in the data series as select data points for inclusion in the simplified following distance data; defining a reference line between the first data point in the data series and the last data point in the data series; identifying a candidate data point between the first data point in the data series and the last data point in the data series, where a minimum difference between the candidate data point and the reference line is greater than a minimum difference between other data points in the data series and the reference line; if the minimum difference between the candidate data point and the reference line exceeds a difference threshold: identifying the candidate data point as a select data point for inclusion in the simplified following distance data; defining a component data series between the first data point in the data series and the candidate data point; and defining another component data series between the candidate data point and the last data point in the data series; and if the minimum difference between the candidate data point and the reference line exceeds a difference threshold: excluding each data point of the data series between the first data point and the last data point from inclusion in the simplified following distance data.
The method may further comprise: accessing a plurality of images captured by an image capture device positioned at the subject vehicle; and identifying the set of images as images of the plurality of images which include another vehicle positioned in front of the subject vehicle.
The method may further comprise: accessing a plurality of images captured by an image capture device positioned at the subject vehicle; and identifying the set of images as images of the plurality of images which include another vehicle positioned in front of the subject vehicle and in a common lane of travel with the subject vehicle.
For each image in the subset of images, determining the respective physical distance between the subject vehicle and the other vehicle may comprise: applying a following distance determination model to the respective image, the following distance determination model being a machine learning model trained based on minimization of a following distance loss function for a training set of images representing a respective lead vehicle from a perspective of a following vehicle associated with a label indicating following distance between the lead vehicle and the following vehicle.
For each image in the subset of images, determining the respective physical distance between the subject vehicle and the other vehicle may comprise: determining, by the at least one processor, at least one image attribute based on a positional measure between the other vehicle as represented in the respective image and at least one boundary of the image; and applying, by the at least one processor, a following distance determination model to determine a following distance based on the determined at least one image attribute.
For each image in the subset of images, determining the respective physical distance between the subject vehicle and the other vehicle may comprise: determining, by at least one processor, a first vertical position in the image representing a bottom of the other vehicle; accessing, by the at least one processor, a second vertical position in the image representing a static physical distance from the subject vehicle; determining a transformed first vertical position by applying, by the at least one processor, an image transformation matrix to the first vertical position; determining a transformed second vertical position by applying, by the at least one processor, the image transformation matrix to the second vertical position; determining an image distance between the transformed first vertical position and the transformed second vertical position; and determining the following distance as a physical distance between the other vehicle and the subject vehicle based on the determined image distance and the static physical distance.
Outputting the simplified following distance data may comprise transmitting, by a communication interface positioned at the subject vehicle, the simplified following distance data to a device remote from the subject vehicle. The method may further comprise presenting, by a user interface of the device remote from the vehicle, at least a portion of the following distance data.
The method may further comprise, for each data point in the simplified following distance data: determining, by the at least one processor, whether the respective physical distance between the subject vehicle and the other vehicle for the data point is within a tailgating distance criteria; identifying, by the at least one processor, that the subject vehicle is not tailgating the other vehicle at a time corresponding to the data point if the respective physical distance is outside of the tailgating distance criteria; and identifying, by the at least one processor, that the subject vehicle is tailgating the other vehicle at a time corresponding to the data point if tailgating criteria are met, wherein the tailgating criteria includes the respective physical distance being within the tailgating distance criteria. The method may further comprise, in response to an identification that the subject vehicle is tailgating the other vehicle, outputting a tailgating indication. Outputting the tailgating indication may comprise transmitting the tailgating indication to a management device. The method may further comprise: receiving, at a management device, respective simplified following distance data for a plurality of subject vehicles; receiving, at the management device, location data for the plurality of subject vehicles; accessing, by the management device, respective tailgating indications for the plurality of subject vehicles; and for each tailgating indication, associating a location of the respective subject vehicle as indicated in the location data, at a time of the tailgating indication, with a roadway segment corresponding to the location of the subject vehicle. The method may further comprise: quantifying, by at least one processor of the management device, an amount of tailgating indications for at least one roadway segment of interest; and identifying, by the at least one processor of the management device, at least one high-risk roadway segment, by identifying at least one roadway segment where an amount of associated tailgating indications exceeds a tailgating risk threshold.
The method may further comprise: receiving, by a management device, respective simplified following distance data for a plurality of subject vehicles; receiving, by the management device, respective location data for the plurality of subject vehicles; for each data point in the simplified following distance data for each subject vehicle of the plurality of subject vehicles: associating a location of the respective subject vehicle at a respective time of the data point with a roadway segment corresponding to the location of the respective vehicle at the respective time; and quantifying following distance for the plurality of subject vehicles for at least one roadway segment of interest.
According to yet another broad aspect, the present disclosure describes a system for generating following distance data over time for a subject vehicle, the system comprising: at least one processor; at least one non-transitory processor-readable storage medium storing processor-executable instructions which when executed by the at least one processor cause the system to: access a set of images, each image of the set of images being from a perspective of the subject vehicle and including a respective representation of another vehicle positioned in front of the subject vehicle, and each image of the set of images being associated with a respective time; generate following distance data based on at least a subset of images in the set of images, by: for each image in the subset of images, determining a respective physical distance between the subject vehicle and the other vehicle; and compiling the following distance data over time as a plurality of following distance data points, each following distance data point representing the determined physical distance for a respective image in the subset of images and the time associated with the respective image in the subset of images; generate simplified following distance data, by: identifying select data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data; and compiling the select data points as the simplified following distance data, excluding data points which are not identified as select data points; and output the simplified following distance data.
Identifying select data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data, may comprise, for each data point of the following distance data: determining a minimum difference between the data point and a corresponding reference line of the iteratively-defined reference lines; and identifying the data point as a select data point if the minimum difference between the data point and the corresponding reference line exceeds a difference threshold.
Identifying select data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data, may comprise: defining a reference line through the following distance data; identifying a candidate data point of the following distance data, where a minimum difference between the candidate data point and the reference line is greater than a minimum difference between other data points and the reference line; and if the minimum difference between the candidate data point and the reference line exceeds a difference threshold: identifying the candidate data point as a select data point for inclusion in the simplified following distance data; and defining new reference lines which intersect the candidate data point through the following distance data. Identifying select data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data, may further comprise: defining an additional reference line through the following distance data which intersects a select data point previously identified for inclusion in the simplified following distance data; identifying an additional candidate data point of the following distance data, where a minimum difference between the additional candidate data point and the additional reference line is greater than a minimum difference between other data points and the additional reference line; and if the minimum difference between the additional candidate data point and the additional reference line exceeds the difference threshold: identifying the additional candidate data point as an additional select data point for inclusion in the simplified following distance data.
The following distance data may comprise at least one data series of following distance data points; identifying select data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data, may comprise, for each data series in the following distance data, until all data points in each data series are within a distance threshold of a corresponding reference line: identifying a first data point in the data series and a last data point in the data series as select data points for inclusion in the simplified following distance data; defining a reference line between the first data point in the data series and the last data point in the data series; identifying a candidate data point between the first data point in the data series and the last data point in the data series, where a minimum difference between the candidate data point and the reference line is greater than a minimum difference between other data points in the data series and the reference line; if the minimum difference between the candidate data point and the reference line exceeds a difference threshold: identify the candidate data point as a select data point for inclusion in the simplified following distance data; define a component data series between the first data point in the data series and the candidate data point; and define another component data series between the candidate data point and the last data point in the data series; and if the minimum difference between the candidate data point and the reference line exceeds a difference threshold: exclude each data point of the data series between the first data point and the last data point from inclusion in the simplified following distance data.
The processor-executable instructions may further cause the system to: access a plurality of images captured by an image capture device positioned at the subject vehicle; and identify the set of images as images of the plurality of images which include the other vehicle positioned in front of the subject vehicle.
The processor-executable instructions may further cause the system to: access a plurality of images captured by an image capture device positioned at the subject vehicle; and identify the set of images as images of the plurality of images which include the other vehicle positioned in front of the subject vehicle and in a common lane of travel with the subject vehicle.
For each image in the subset of images, determining the respective physical distance between the subject vehicle and the other vehicle may comprise: applying a following distance determination model to the respective image, the following distance determination model being a machine learning model trained based on minimization of a following distance loss function for a training set of images representing a respective lead vehicle from a perspective of a following vehicle associated with a label indicating following distance between the lead vehicle and the following vehicle.
For each image in the subset of images, determining the respective physical distance between the subject vehicle and the other vehicle may comprise: determining, by the at least one processor, at least one image attribute based on a positional measure between the other vehicle as represented in the respective image and at least one boundary of the image; and applying, by the at least one processor, a following distance determination model to determine a following distance based on the determined at least one image attribute.
For each image in the subset of images, determining the respective physical distance between the subject vehicle and the other vehicle may comprise: determining, by the at least one processor, a first vertical position in the image representing a bottom of the other vehicle; accessing, by the at least one processor, a second vertical position in the image representing a static physical distance from the subject vehicle; determining a transformed first vertical position by applying, by the at least one processor, an image transformation matrix to the first vertical position; determining a transformed second vertical position by applying, by the at least one processor, the image transformation matrix to the second vertical position; determining an image distance between the transformed first vertical position and the transformed second vertical position; and determining the following distance as a physical distance between the other vehicle and the subject vehicle based on the determined image distance and the static physical distance.
The system may further comprise a communication interface positioned at the subject vehicle, and the processor-executable instructions which cause the system to output the simplified following distance data may cause a communication interface positioned at the subject vehicle to transmit the simplified following distance data to a device remote from the subject vehicle. The system may further comprise a user interface of the device remote from the vehicle to present at least a portion of the following distance data.
The processor-executable instructions may further cause the system to, for each data point in the simplified following distance data: determine, by the at least one processor, whether the respective physical distance between the subject vehicle and the other vehicle for the data point is within a tailgating distance criteria; identify, by the at least one processor, that the subject vehicle is not tailgating the other vehicle at a time corresponding to the data point if the respective physical distance is outside of the tailgating distance criteria; and identify, by the at least one processor, that the subject vehicle is tailgating the other vehicle at a time corresponding to the data point if tailgating criteria are met, wherein the tailgating criteria includes the respective physical distance being within the tailgating distance criteria. The processor-executable instructions may further cause the system to, in response to an identification that the subject vehicle is tailgating the other vehicle, output a tailgating indication. The processor-executable instructions which cause the system to output the tailgating indication may cause a communication interface of the system to transmit the tailgating indication to a management device. The processor-executable instructions may further cause the system to: receive, at a management device of the system, respective simplified following distance data for a plurality of subject vehicles; receive, at the management device, location data for the plurality of subject vehicles; access, by the management device, respective tailgating indications for the plurality of subject vehicles; and for each tailgating indication, associate a location of the respective subject vehicle as indicated in the location data, at a time of the tailgating indication, with a roadway segment corresponding to the location of the subject vehicle. The at least one processor may include at least one processor at the management device, and the processor-executable instructions may further cause the system to: quantify, by the at least one processor of the management device, an amount of tailgating indications for at least one roadway segment of interest; and identify, by the at least one processor of the management device, at least one high-risk roadway segment, by identifying at least one roadway segment where an amount of associated tailgating indications exceeds a tailgating risk threshold.
The system may further comprise a management device; the at least one processor may include at least one processor at the management device; and the processor-executable instructions may further cause the system to: receive, by a management device, respective simplified following distance data for a plurality of subject vehicles; receive, by the management device, respective location data for the plurality of subject vehicles; for each data point in the simplified following distance data for each subject vehicle of the plurality of subject vehicles: associate, by the at least one processor of the management device, a location of the respective subject vehicle at a respective time of the data point with a roadway segment corresponding to the location of the respective vehicle at the respective time; and quantify, by the at least one processor of the management device, following distance for the plurality of subject vehicles for at least one roadway segment of interest.
Exemplary non-limiting embodiments are described with reference to the accompanying drawings in which:
The present disclosure details systems and methods for creating training data, for training machine learning models, and for applying machine learning models, for identifying vehicle movement and positioning. The present disclosure sees particular value in detecting travel lane of vehicles, determining distance between vehicles, and identifying when a vehicle is tailgating another vehicle.
Throughout this disclosure, a “following” situation refers to a situation where a “following vehicle” is travelling behind a “lead vehicle”, in the same direction as the lead vehicle. In this context, “following” does not necessarily mean that the following vehicle is actively pursuing the lead vehicle (e.g. to the destination of the lead vehicle), but rather that the following vehicle is travelling behind the lead vehicle, for at least a moment in time. Lead vehicles and following vehicles are commonly referred to as first and second vehicles throughout this disclosure.
“Tailgating” generally refers to a situation involving two vehicles travelling in the same direction, where one vehicle is following the other vehicle at an unsafe distance (e.g. too close for the following vehicle to reliably safely stop if needed). In particular, if a following vehicle is tailgating a lead vehicle, sudden braking of the lead vehicle may result in an accident where the following vehicle hits the lead vehicle from behind. For instance, delayed reaction time of the driver of the following vehicle may prevent the following vehicle from decelerating at a sufficient rate so as to avoid rear-ending the lead vehicle. However, if the driver of the following vehicle was alerted of this dangerous circumstance, an accident may be avoided, by causing the driver of the following vehicle to alter operation of the following vehicle to increase the following distance from the lead vehicle.
Models (e.g. artificial intelligence and/or machine learning models) for identifying vehicle positioning and movement, based on data captured by one or more image capture devices (e.g. video cameras or smart video cameras) are disclosed herein. Generally, a machine learning model is trained based on a set of training data, after which the model becomes able to analyze input data and reliably detect features or make determinations based on the input data. In some implementations, a trained model is deployed to an image capture device or a proximate device communicatively coupled to the image capture device, and captured image data is analyzed by the trained model. Such implementations are optimal for alerting the driver to dangerous situations, as analysis can be performed quickly without the need for communication with a remote server. In alternative implementations, captured image data is analyzed in accordance with the trained model remote from the image capture device (e.g. at a central server or processing station). Such implementations are useful for identifying dangerous situations after-the-fact, such as for driver evaluation or collision reconstruction. However, such implementations could also be used to alert the driver to dangerous situations as they happen, albeit after communication of image data to the central server, followed by a message from the server to a device at the vehicle to output an alert to the driver. In yet other implementations, captured image data can be analyzed at an image capture device or a proximate device communicatively coupled to the image capture device, and results can be sent to a remote device (e.g. a central server or processing station), such as for driver evaluation or collision reconstruction. In yet other implementations, captured image data can be analyzed at an image capture device or a proximate device communicatively coupled to the image capture device, for immediate driver feedback, and captured image data can be analyzed at a remote device such as for driver evaluation or collision reconstruction.
Communication network 100 may include one or more computing systems and may be any suitable combination of networks or portions thereof to facilitate communication between network components. Some examples of networks include, Wide Area Networks (WANs), Local Area Networks (LANs), Wireless Wide Area Networks (WWANs), data networks, cellular networks, voice networks, among other networks, which may be wired and/or wireless. Communication network 100 may operate according to one or more communication protocols, such as, General Packet Radio Service (GPRS), Universal Mobile Telecommunications Service (UMTS), GSM®, Enhanced Data Rates for GSM Evolution (EDGE), LTE™, CDMA, LPWAN, Wi-Fi®, Bluetooth®, Ethernet, HTTP/S, TCP, and CoAP/DTLS, or other suitable protocol. Communication network 100 may take other forms as well.
Mobile image system 101A includes a plurality of image capture devices 108, which can comprise (and be referred to herein) as smart video cameras (SVCs), though are not strictly limited as such. The plurality of image capture devices 108 are positioned at (e.g. mounted in/on, or placed within or on) a plurality of vehicles 110. Image capture system 101A also includes cloud server 106, client device 104 and local server 118. Client device 104 is communicatively coupled to local server 118 via communication link 120. Client device 104 is also shown as including at least one processor 104a and at least one non-transitory processor-readable storage medium 104b. The at least one processor 104a can perform acts such as determinations, identification, data analysis, processing, and other appropriate acts, such as acts in the methods described herein. The at least one non-transitory processor-readable storage medium 104b can store any appropriate data, including processor-executable instructions which when executed by the at least one processor 104a cause the client device 104 to perform acts, such as acts of the methods described herein. An exemplary client device may include a personal computer, server, a system, a combination of subsystems, and devices. Specific and non-limiting examples of an image capture device or smart video camera include a Netradyne® video camera and a Nauto® video camera. Reference to a “camera” in this disclosure can include a smart video camera, but may also include a more basic camera. In this regard, the term “camera” can be used interchangeably with “image capture device”. Each image capture device 108 is communicatively coupled to cloud server 106 in cloud 112 via a respective communication link 116. For example, each image capture device 108 and the cloud server 106 are configured to wirelessly communicate to each other. Cloud server 106 is also shown as including at least one processor 106a and at least one non-transitory processor-readable storage medium 106b. The at least one processor 106a can perform acts such as determinations, identification, data analysis, processing, and other appropriate acts, such as acts in the methods described herein. The at least one non-transitory processor-readable storage medium 106b can store any appropriate data, including processor-executable instructions which when executed by the at least one processor 106a cause the cloud server 106 to perform acts, such as acts of the methods described herein. Cloud server 106 is communicatively coupled to client device 104 via communication link 114. For example, each cloud server 106 and client device 104 are configured to wirelessly communicate to each other. As another example, cloud server 106 and client device 104 are configured to communicate with each over a wired connection. In some implementations, local server 118 may be a remote server from client device 104. Local server 118 is also shown as including at least one processor 118a and at least one non-transitory processor-readable storage medium 118b. The at least one processor 118a can perform acts such as determinations, identification, data analysis, processing, and other appropriate acts, such as acts in the methods described herein. The at least one non-transitory processor-readable storage medium 118b can store any appropriate data, including processor-executable instructions which when executed by the at least one processor 118a cause the local server 118 to perform acts, such as acts of the methods described herein.
Mobile image system 101B in
Specific and non-limiting examples of vehicle types which each of vehicles 110 can be include: a government owned and operated vehicle, (e.g., as a vehicle for snow clearing, infrastructure maintenance, police enforcement), a public transportation vehicle, (e.g., bus, train), and a privately owned vehicle, (e.g., taxi, courier vehicle), among others.
An image capture device 108 may be mounted to or positioned at a vehicle 110 in a manner such that image capture device 108 captures image data of the environment outside the vehicle 110, e.g., towards the windshield, towards a window, atop the vehicle, etc. Additionally, and/or optionally, an image capture device 108 may be mounted to or positioned at a vehicle 110 in a manner such that the image capture device 108 captures image data of the interior of the vehicle. Interior-facing image capture devices 108 may be useful for detecting an event including detecting a person(s) of interest.
Alternatively, and/or optionally, mobile image systems 101A, 101B further include one or more image capture devices 108 coupled to a person and/or object wherein the object is not a vehicle. For example, an image capture device 108 can be coupled to a person, e.g., a helmet of a motorcycle driver.
Now referring to
Further optionally, the image capture device 108A is shown as including at least one output interface 218, which can provide output to a driver of a vehicle in which the image capture device 108A is installed. For example, output interface 218 can be an audio output device (a speaker) which outputs audio for the driver to hear. As another example, output interface 218 can be a visual output device, such as a light or plurality of lights, or a display which outputs visual signals for the driver to see. As yet another example, output interface 218 can be a haptic output device, such as a vibration device, which outputs haptic signals for the driver to feel. Output interface 218 can be used to provide alerts, warnings, or feedback directly to the driver (such as tailgating indications), as discussed in detail later.
Now referring to
Collectively, reference to an image capture device 108 or a plurality of image capture devices 108 can include image capture device 108A in
Reference to “at least one processor” or “a processor” performing acts of any of the methods herein can refer to any appropriate processor. Further, at least one non-transitory processor-readable storage medium can store processor-executable instructions, which when executed by a respective at least one processor cause the corresponding system or device to perform a given act of any of the methods discussed herein.
At 402, image data is accessed. The image data is captured by an image capture device (such as image capture device 108A or 108B discussed with reference to
At 404, at least one processor (e.g. processor 206, 104a, 118a, 106a, or 312 as appropriate) analyzes the image data to identify vehicles represented in the vehicle data (if any). For example, the at least one processor can run an object detection model (such as a vehicle detection model) trained to detect vehicles in image data. The YOLO models are exemplary models which are effective in this task.
At 406, a following distance is determined between two vehicles based on the image data. In particular, a following distance is determined between a first vehicle (a lead vehicle) and a second vehicle (a following vehicle) where the image capture device is positioned. The perspective of the image data thus represents a perspective from the second vehicle. Act 406 in method 400 is shown as including sub-acts 420 and 422. Sub-acts 420 and 422 show one exemplary implementation for determining following distance between the two vehicles, and could be replaced by any other appropriate means for determining following distance.
At 420, a following situation is identified where the second vehicle is driving behind the first vehicle. In some implementations, this entails determining a lane of travel of the first vehicle and of the second vehicle, as shown in sub act 430. If the determined lane of travel of the first vehicle is the same as the determined lane of travel of the second vehicle, the first and second vehicle are identified as travelling in the same lane. Alternatively, in some implementations, lanes do not need to be explicitly determined, nor does lane of travel of each vehicle need to be explicitly identified. Instead, a determination is made regarding whether the first vehicle and the second vehicle are travelling in a manner which indicates that the second vehicle is travelling behind the first vehicle. Such implementations are discussed later with reference to
At 422, a distance is determined between the first vehicle and the second vehicle. Throughout this disclosure, means for determining distance between two vehicles are discussed. For example,
At 408, a determination is made as to whether the second vehicle is tailgating the first vehicle. Generally, this entails determining whether the distance between the first vehicle and the second vehicle determined at 422 is within tailgating criteria. Such tailgating criteria can be static or dynamic, and is discussed in more detail later with reference to
At 410, an indication is output when the second vehicle is tailgating the first vehicle. For example, an alert can be output by a device in the second vehicle (e.g. output interface 218 in
To generate a predictive model which determines distance between vehicles based on image data, training image data is generated. Such predictive models are trained based on the generated training data. This disclosure describes generation of simulated training data (in the form of computer-rendered image data). Image data representing real-world tailgating situations, as captured by real-world image capture devices, would be good training data; however, such real-world image data is dangerous to collect. In particular, vehicles would need to engage in tailgating in order to capture image data representing tailgating, and thus said vehicles would need to engage in dangerous situations which the present disclosure aims to avoid. Generating simulated training data avoids such dangerous situations, while still providing reasonably accurate data for training predictive models.
Reference to “at least one processor” or “a processor” performing acts of any of the methods herein can refer to any appropriate processor. Further, at least one non-transitory processor-readable storage medium can store processor-executable instructions, which when executed by a respective at least one processor cause the corresponding system or device to perform a given act of any of the methods discussed herein.
In method 500, a plurality of instances are simulated where a first vehicle has a respective first position, and a second vehicle is simulated as following the first vehicle. In 510, a number of sub-acts (illustrated as sub-acts 511, 512, 513, 514, and 515) are performed for each instance in a plurality of instances.
At 511, parameter data indicating the first position of the first vehicle, and the second position of a virtual camera representing a perspective from a second vehicle positioned behind the first vehicle is received. The first position and the second position are specific to the instance, such that each instance represents a specific scenario (within the overall context of the dataset) where the first vehicle is being followed by the second vehicle at a specific distance, and each instance is generally different from other instances. Differences between instances can include differences in the following distance between the first vehicle and the second vehicle (due to differences between the first position and the second position for the instance). However, following distance between the first vehicle and the second vehicle is not necessarily unique to the instance in the data set. In particular, the first position and the second position represent positions in space within a virtual environment. Even if the first position and the second position for different instances are the same distance apart (same following distance), respective first positions and second positions can be at different locations within the virtual environment. As a result, a perspective represented by the virtual camera for an instance will be different from other instances, even if following distance is the same.
Positions of vehicles (or cameras in vehicles), such as the first position, second position, and the third position discussed later with reference to
At 512, for each instance, the first vehicle at the first position, and the virtual camera at the second position, are simulated in a virtual environment. An example of this is illustrated in
For the first instance, the first position of vehicle 610 and the second position of the virtual camera of the second vehicle 612 are such that the second vehicle 612 is following the first vehicle 610 at a close distance. For the second instance, the first position of vehicle 620 and the second position of the virtual camera of the second vehicle 622 are such that the second vehicle 622 is following the first vehicle 620 at a distance which is greater than the following distance in the first instance. For the third instance, the first position of vehicle 630 and the second position of the virtual camera of the second vehicle 632 are such that the second vehicle 632 is following the first vehicle 630 at a distance which is equal to the following distance in the second instance. However, because the respective second position for the second instance and the third instance is different, a perspective of the virtual camera for the second instance is different from a perspective of the virtual camera for the third instance.
Consequently, image data generated for the second instance is different from image data generated for the third instance.
Although
Returning to method 500 in
Image 700 is rendered from the perspective of a virtual camera positioned in a second vehicle which is following a first vehicle 790. Rendered image 700 shows a horizon 702 and a roadway delineated by road edges 710 and 712. The illustrated roadway includes two separate lanes 720 and 722, separated by dashed center line 714. In the illustrated example, vehicle 790 and the following vehicle are represented as driving in the right-hand lane, though image data can be rendered with vehicles travelling in any appropriate lane.
The virtual environment can be modelled and rendered using any appropriate technology. In some implementations, autonomous vehicle operation software is used to modelled and render the image data. Such software can include, for example, CARLA™, Simulator for Urban Driving in Massive Mixed Traffic (SUMMIT), Procedural Generation Drive (PGDrive), LG Silicon Valley Lab (LGSVL), and NVIDIA DRIVE Sim™.
Based on the first position and second position for an instance, as well as the surrounding environment features, exactly what appears in rendered image data such as image 700 varies per instance. Generally, the closer the first vehicle and the second vehicle are, the larger the first vehicle (vehicle 790 in
Returning to method 500 in
At 515, the at least one image for the instance (as rendered in act 513) is output. The output at least one image is also associated with a label indicative of the distance between the first vehicle and the second vehicle for the instance. In implementations where the distance between the first position and the second position is determined in act 514, the at least one image is output associated with a label indicative of the determined distance. In implementations where the distance between first position and the second position is included in the parameter data accessed in act 511, the at least one image is output associated with a label which indicates the distance between the first vehicle and the second vehicle (first position and second position) as included in the parameter data for the instance.
Acts 520 and 522 are shown in dashed lines, to illustrate that these acts are optional. Acts 520 and 522 are discussed in detail later, with reference to
At 524, a first plurality of images are stored at a non-transitory processor-readable storage medium, which includes each at least one image output for each instance of the plurality of instances. Each of the stored images is stored associated with the respective label indicating distance between the first vehicle and the second vehicle for the respective instance. As a result, the first plurality of images includes a plurality of images representing different instances where a second vehicle is positioned behind a first vehicle, labelled with a distance between the first vehicle and the second vehicle for each instance. In this way, the first plurality of images is an effective set of training data where distance between the first vehicle and the second vehicle is known, so that a machine learning model can be trained to determine distance between the first vehicle and the second vehicle, using the associated distance labels as validation.
Method 500 in
In method 800, a plurality of instances are simulated where a third vehicle has a third position, as is not following another vehicle (or not at a distance close enough to be accurately determinable or important). In 810, a number of sub-acts (illustrated as sub-acts 811, 812, 813, and 815) are performed for each instance in a plurality of instances.
At 811, parameter data indicating the third position of a virtual camera representing a perspective from the third vehicle is received. The third position is specific to the instance, such that each instance represents a specific scenario (within the overall context of the dataset). As a result, a perspective represented by the virtual camera for an instance will be different from other instances.
At 812, for each instance, the virtual camera at the third position is simulated in a virtual environment. An example of this is illustrated in
For the first instance, the third vehicle 912 is not positioned behind (following) another vehicle. For the second instance, the third vehicle 922 is positioned behind first vehicle 920, but at a great distance. The distance between first vehicle 920 and third vehicle 922 is so great that it is not important for detection of tailgating, and further is possibly not accurately determinable based on image data.
Although
Returning to method 800 in
Image 1000A is rendered from the perspective of a virtual camera positioned in a third vehicle (e.g. third vehicle 912 in
The virtual environment can be modelled and rendered using any appropriate technology, as discussed above with reference to
Based on the third position for an instance, as well as the surrounding environment features, exactly what appears in rendered image data such as image 1000A varies per instance.
Image 1000B is rendered from the perspective of a virtual camera positioned in a third vehicle (e.g. third vehicle 922 in
The virtual environment can be modelled and rendered using any appropriate technology, as discussed above with reference to
Based on the third position for an instance, as well as the surrounding environment features, exactly what appears in rendered image data such as image 1000B varies per instance.
Returning to method 800 in
Act 814 is drawn in dashed lines, to illustrate that act 814 is optional. In some implementations, instead of act 814, the parameter data accessed at 811 includes an indication of distance between the third position and another vehicle as corresponding to a non-following situation for the instance.
At 815, the at least one image for the instance (as rendered in act 813) is output. The output at least one image is also associated with a label indicative of the non-following situation (whether this value is determined in act 814, or is included in the parameter data at 811).
Acts 820 and 822 are shown in dashed lines, to illustrate that these acts are optional. Acts 820 and 822 are discussed in detail later, with reference to
At 824, a second plurality of images are stored at a non-transitory processor-readable storage medium, which includes each at least one image output for each instance of the plurality of instances. Each of the stored images is stored associated with the respective label indicating the non-following situation for distance between the third vehicle and another vehicle for each instance. As a result, the second plurality of images includes a plurality of images representing different instances where a third vehicle is positioned a great distance from (or not even within sight of) another vehicle, labelled with non-following situation value indicating the third vehicle is not (within the context of the models to be trained) following another vehicle. In this way, the second plurality of images is an effective set of training data where it is known that the third vehicle is not (within the context of models to be trained) following another vehicle, so that a machine learning model can be trained to account for such scenarios, using the associated non-following situation labels as validation.
With reference to acts 511 in method 500 and 811 in method 800, in some implementations accessed parameter is provided/received as user input. For example, in device 300 in
In other implementations, with reference to acts 511 in method 500 and 811 in method 800, accessed parameter data is autonomously generated by at least one processor. For example, for each instance in the first plurality of instances (as discussed with reference to method 500 in
In an exemplary scenario, the at least one processor can autonomously determine a random position within the virtual environment, and randomly determine another position within the virtual environment which is within the distance threshold from the random position. These two randomly generated positions are the first and second positions, and can be determined in either order (i.e. first position is determined first, or second position is determined first). Further, autonomous determination of positions can be constrained based on features of the virtual environment. For example, random determination of positions can be limited to positions which are on roadways of the virtual environment. Further, random determinations of first and second positions of two vehicles can be constrained to positions in a same lane of a roadway of the virtual environment.
In another example, for each instance in the second plurality of instances (as discussed with reference to method 800 in
With further reference to acts 511 in method 500 and 811 in method 800, in some implementations accessed parameter data further indicates a resolution for the virtual camera. For example, a user can input a camera resolution via input devices 324, or a resolution may be stored in the system based on known camera hardware which the training data is being created for. Regardless, in acts 513 in method 500 and 813 in method 800, rendering an image for a particular instance entails rendering the image at the resolution of the virtual camera specified in the parameter data.
With further reference to acts 511 in method 500 and 811 in method 800, in some implementations accessed parameter data further indicates vehicle types (type of the first vehicle, type of the second vehicle, and/or type of the third vehicle). Such vehicle types can be specific to a particular instance, but are not necessarily unique to an instance. Alternatively or additionally, accessed parameter data further indicates vehicle dimensions or properties (e.g. size and/or weight of the first, second, or third vehicles). Vehicle type, properties (particularly weight), and dimensions can all have an impact on distance determination and tailgating detection. As one example, different vehicles have different dimensions from where a dashcam is mounted and a front of the vehicle (e.g., different vehicles have different lengths of hood). As another example, a heavier vehicle will take typically take longer to stop that a lighter vehicle, and thus unsafe tailgating distance is different between vehicles. By including these vehicle parameters in the respective parameter data, training data can be created which covers a broader range of vehicles and circumstances, and thus when said training data is used to train models, the resulting models should be more robust.
With further reference to acts 511 in method 500 and 811 in method 800, in some implementations accessed parameter data further indicates position and/or orientation of the virtual camera relative to the second vehicle. Position and orientation of a camera impact a resulting image captured by said camera, and thus including such information with the parameter data results in more accurate training data, and thus more accurate models trained based on said data. This principle is illustrated with reference to
With further reference to acts 511 in method 500 and 811 in method 800, in some implementations accessed parameter data further indicates attributes of the virtual camera. Attributes impact a resulting image captured by said camera, and thus including such information with the parameter data results in more accurate training data, and thus more accurate models trained based on said data. Exemplary camera attributes indicated in the parameter data could include, for example, resolution, lens focal length, lens type, or any other appropriate attributes. This principle is illustrated with reference to
When rendering images (as in act 513 of method 500 or act 813 of method 800), the properties of the camera can be accounted for, and images rendered as if they were captured by such a camera. Alternatively, subsets of images can be selected, and camera distortion effects applied after the image data is rendered, as discussed later with reference to
With further reference to acts 511 in method 500 and 811 in method 800, in some implementations accessed parameter data further indicates environmental conditions, or information from which environmental conditions can be derived. Environmental conditions impact a resulting image captured by said camera, and thus including such information with the parameter data results in more accurate training data, and thus more accurate models trained based on said data. This principle is illustrated in
In some implementations, the parameter data indicates weather conditions and/or lighting conditions. For example, the parameter data could indicate rain, snow, sleet, fog, sun, clouds, or any other appropriate weather conditions. In some implementations, the parameter data indicates time of day and/or date, from which weather conditions and/or lighting conditions can be generated, estimated, or retrieved (e.g. from a weather service). When simulating vehicles in the virtual environment and rendering images (as in acts 512 and 513 of method 500 or acts 812 and 813 of method 800), the environment and environment conditions can also be simulated, which results in visible changes in the image data. Alternatively, subsets of images can be selected, and environmental distortion effects applied after the image data is rendered, as discussed later with reference to
Other attributes of the environment can also be indicated in the respective parameter data, to be rendered in image data. As an example, atmospheric light scattering properties (scattering intensity, Rayleigh Scattering scale, Mie Scattering scale, etc.) can be indicated in the respective parameter data. Rendered image data can account for such scattering properties (e.g. by rendering sky with appropriate hue, saturation, and gradient). As another example, properties of vehicle 1620 can be indicated in the respective parameter data, such as vehicle color, vehicle dimensions, and/or vehicle model. Rendered image data can account for such vehicle properties (e.g. by rendering the vehicle of appropriate size, shape, and color).
Beyond environmental effects, image data can also show rendered technical effects.
When rendering images (as in act 513 of method 500 or act 813 of method 800), technical effects can also be simulated, which results in visible changes in the image data. Alternatively, subsets of images can be selected, and technical distortion effects applied after the image data is rendered, as discussed below with reference to
Respective parameter data for each generated image can be input manually; that is, an operator could input values for a number of properties for each image to be generated via a user input device such as those discussed earlier. However, in order to generate a large library of training images (which will result in a more robust model trained based on the training images), parameter data can be autonomously generated by at least one processor. To this end, instructions can be provided (e.g. by a user via a user input device) regarding ranges of parameters for image generation.
In some implementations, prior to act 511 in method 500 or act 811 in method 800 (that is, prior to accessing respective parameter data, for subsequent generation of image data for a particular instance), individual parameter data is generated for the particular instance. That is, specific values for each parameter of interest are provided, and method 500 or method 800 proceed to generate image data for the instance based on the specific values provided.
In other implementations, in act 511 in method 500 or act 811 in method 800, general parameter data is accessed (e.g. such as the list shown in
In some implementations, image data can be generated, and image distortion effects (whether environmental or technical) can be applied afterwards to the generate image data. In particular, from the plurality of instances for which image data is generated (the first plurality of instances in method 500 and/or the second plurality of instances in method 800), a subset of instances is selected. For each instance of the subset of instances, at least one distortion effect (e.g. camera attributes as discussed with reference to
Synthetic image 1 is run through a distortion module, which applies distortion effects thereto. In the illustrated example, four different distortion “schemes” are applied to Synthetic Image 1, to generate four respective distorted images, labelled “Distorted Image 1”, “Distorted Image 2”, Distorted Image 3″, and “Distorted image 4”. Distortion “scheme” refers to a specific type, collection, magnitude, or other properties of distortion, which result in different distorted images. For example, a 25% downscale-upscale distortion (described later) can be applied to Synthetic Image 1 to generate Distorted Image 1; a 50% downscale-upscale distortion can be applied to Synthetic Image 1 to generate Distorted image 2; a raindrop effect can be applied to Synthetic Image 1 to generate Distorted Image 3; a combination of a raindrop effect and a 50% downscale-upscale distortion can be applied to Synthetic Image 1 to generate Distorted Image 4.
Generally, a greater variety of distorted images, with varying distortions applied thereto, to varying degrees, and in combination with other distortions, will result in a large data set which simulates many different scenarios, camera configurations, and data pipelines, and thus will generally result in training data which, when used to a train a machine learning model, will result in the machine learning model being more robust. Every possible type of distortion, and every possible combination of distortion effects, is not listed herein for brevity. However, generally the disclosure can be applied to arrive at a set of distortion effects (and combinations of distortion effects) which result in meaningful training data representative of real-world effects. Some example distortions are discussed below.
A downscale-upscale distortion refers to a process where an image is downscaled by a certain amount (e.g. to 25%, 50%, 75%, or any other appropriate resolution), and then upscaled back to the original resolution. Such a process simulates blurring effects, artifacting, motion, low sensor quality, compression loss, and other image data effects. Generally, the lower the resolution which the image is downscaled to, the greater the resulting distortions.
A compress-decompress distortion refers to a process where an image is compressed using a lossy compression technology (i.e. a technology where some original data is lost), and subsequently decompressed. Such a process simulates compression artifacting.
A noise distortion refers to a process where random noise is introduced into an image. For example, a noise filter can be applied over the image. An adversarial effect distortion similarly introduces noise into an image, but said noise is very specific and designed to cause a trained model to produce false output when analyzing such an image.
Blur distortion refers to applying a blur filter to an image.
Pixel value distortion refers to distorting pixel values within an image (e.g. to oversaturate, undersaturate, discolor an image, brighten, or darken an image).
Motion distortion refers to applying a motion filter to an image, e.g. by blurring or skewing the image in a direction to simulate movement during image capture.
A lens-obstruction effect distortion refers to a process where effects of external substances on the camera lens are simulated. For example, water spots such as those illustrated in
Environmental distortion to simulate environment effects can be applied, such as lens flare, image level adjustments (brightness, contrast, etc.), pixel values can be adjusted, environmental filters can be applied or overlayed (e.g. precipitation filters).
Camera property distortions could also be applied, such as warping areas of the image to simulate focal length effects.
Applying effects to image data (whether the effects are related to camera properties, environmental conditions, technical effects, or other effects) can be referred to as domain randomization.
The example images rendered in acts 513 in method 500 and 813 in method 800 have thus far been illustrated as being single images rendered for each respective instance. These are valid implementations of methods 500 and 800, but in alternative implementations, a plurality of images can be rendered for respective single instances.
In image 2000A, vehicle 2020 is shown driving in lane 2002, relatively close to the vehicle in which the camera is positioned (relative to images 2000B and 2000C). In image 2000A, tree 2012 is shown relatively far from the vehicle in which the camera is positioned (relative to images 2000B and 2000C).
Image 2000B represents a moment in time after the moment shown in image 2000A. In image 2000B, vehicle 2020 is in the process of changing lanes, from lane 2002 to lane 2004, and is thus driving over dividing line 2006. Further, the vehicle in which the camera is positioned has moved forward, such that stationary tree 2012 appears closer to the camera than in image 2000A. Further still, vehicle 2020 is moving faster than the vehicle in which the camera is positioned, and consequently distance between the two vehicles has grown, such that vehicle 2020 appears further from the camera in image 2000B than in image 2000A.
Image 2000C represents a moment in time after the moment shown in image 2000B. In image 2000C, vehicle 2020 has finished changing lanes, and is now travelling in lane 2004. Further, the vehicle in which the camera is positioned has moved even further forward, such that stationary tree 2012 appears even closer to the camera than in image 2000B. Further still, vehicle 2020 is moving faster than the vehicle in which the camera is positioned, and consequently distance between the two vehicles has grown even more, such that vehicle 2020 appears even further from the camera in image 2000C than in image 2000B.
More or fewer images could be rendered, as appropriate for a given application. By rendering a plurality of images for each instance, training data is more detailed, such that a model trained based on such data will be capable of analyzing vehicle movement over time, instead of trying to understand a situation based on a static image.
In order to render such data, acts 512 and 812 in methods 500 and 800 entail simulating movement of the first vehicle (vehicle 2020 in
Returning to method 400 in
At 2102, image data is accessed by at least one processor of the device performing method 2100. The image data includes at least a first set of images, such as the first plurality of images output at 524 in method 500 discussed with reference to
At 2110, a following distance loss function is minimized over the first set of images. Equation (1) below shows the loss function for this exemplary implementation:
L=P*|D−d|+(P−p)2 (1)
In Equation (1), L represents loss. P is the vehicle presence label, where a label of 0 indicates the first vehicle is not within the vehicle presence threshold, and a label of 1 indicates the first vehicle is within the vehicle presence threshold. Vehicle presence as determined by the model is indicated by p, and is a decimal number between 0 and 1 which represents confidence by the model that the first vehicle is within the vehicle presence threshold (where a higher value means greater confidence, and vice-versa). D is the value for distance indicated in the distance label, and d is the value for distance as determined by the model.
The first term in Equation (1), P*|D−d|, represents the distance regression loss. That is, the difference between the distance as indicated in the label and the distance determined by the model. Where P=1, (vehicle presence label for a particular image indicates that the first vehicle is within the vehicle presence threshold), the first term becomes |D−d|, which represents difference between the distance label and the distance determined by the model (i.e., how accurately the model determined distance, where a higher value indicates greater inaccuracy than a low value). Where P=0, (vehicle presence label for a particular image indicates that the first vehicle is not within the vehicle presence threshold), the first term becomes 0, such that loss L becomes only the second term.
The second term in Equation (1), (P−p)2, represents classification loss. That is, the difference between the vehicle presence as indicated in the vehicle presence label and as determined by the model (i.e., how inaccurately the model classifies whether a vehicle is within the vehicle presence threshold).
In the process of generating training data as discussed earlier with reference to
In some exemplary implementations, the vehicle presence threshold is set to 40 meters. However, any vehicle presence threshold could be used, as appropriate for a given application.
In the example of
At 2114, the determined loss L is compared to a maximum loss threshold by the at least one processor. If determined loss L is not within the maximum loss threshold, method 2100 proceeds to act 2116 where the model is adjusted (e.g. by adjusting weights and biases of the model with the aim of reducing loss). In one exemplary implementation, backpropagation is implemented to adjust weights and biases of the model. One skilled in the art can implement any appropriate model structure and means for adjusting the model, as appropriate for a given application. After the model is adjusted at 2116, method 2100 returns to act 2112, where the following distance function is evaluated for at least one image of the first set of images. The at least one image for which the following distance loss function is evaluated can be the same at least one image as before, such that the adjustments to the model are “tested” against the same image data. Alternatively, the at least one image for which the following distance loss function is evaluated can be a different at least one image, such that the model is adjusted by moving through the first set of images.
Acts 2112, 2114, and 2116 can be iterated any appropriate number of times, until loss is within the maximum loss threshold at 2114, in which case method 2100 proceeds to 2118. At 2118, auxiliary criteria for the model are evaluated. If the auxiliary criteria are not satisfied, method 2100 returns to act 2112, where the following distance loss function is evaluated. Auxiliary criteria can include various criteria. As one example, auxiliary criteria can require that the loss function be within a maximum loss threshold for each image in the first set of images. That is, even if the loss function is within a maximum loss threshold for a first image, the auxiliary criteria can require that each image be evaluated prior to outputting the trained model. As another example, auxiliary criteria can require that the loss function be within a maximum loss threshold for at least a defined amount of images in the first set of images. That is, even if the loss function is within a maximum loss threshold for a first image, the auxiliary criteria can require that the loss function be within the maximum loss threshold for a defined amount (e.g. 90%) of the images in the first set of images. As another example, auxiliary criteria can require that the loss function be evaluated for at least a defined amount of images (e.g. 90%) in the first set of images.
Act 2118 is optional. In one exemplary implementation, evaluating the following distance loss function for at least one image of the first set of images in act 2112 comprises evaluating the following distance loss function for each image of the first set of images (or for a defined amount of images in the first set of images), such that criteria regarding quantity of images to be evaluated are inherently satisfied.
If the auxiliary criteria are satisfied at 2118 (or if act 2118 is not included), method 2100 proceeds to act 2120. At 2120, the model is considered as a “trained” model, and is output for use. For example, the trained model can be sent to another device for storage, distribution, and/or application, or can be stored at a non-transitory processor-readable storage of the device which performed the training.
Exemplary implementations and usage scenarios for method 2100 (in particular act 2110) are discussed below.
In a first example, at 2112 the distance loss function is determined for a first image. The first image is associated with vehicle presence label P1=1 and distance label D1=3 m. In this case, the model determines vehicle presence p1=0.9 and distance as d1=2.5 m. With these values, evaluating Equation (1) results in a distance loss L1=0.51. At 2114, loss L1 is compared to a maximum loss threshold, which in this example is 0.25. Since 0.51 is greater than 0.25, loss L1 is not within the maximum loss threshold, and method 2100 proceeds to act 2116. At 2116, the model is adjusted per a machine learning adjustment process, after which method 2100 proceeds to a second iteration of act 2112. In this first example, the second iteration of act 2112 is run again on the first image. As a result of the adjustments to the model at 2116, the model now determines vehicle presence p2=0.95 and distance as d2=2.9 m. As a result, Equation (1) evaluates to loss L2=0.1025. In a second iteration of act 2114, loss L2 is compared to the maximum loss threshold of 0.25. Since 0.1025 is less than 0.25, loss L2 is within the maximum loss threshold. If no auxiliary criteria are specified (i.e. act 2118 is not included), method 2100 proceeds to act 2120, where the trained model is output.
For a case where an auxiliary criteria is specified in the first example, which requires that the loss be within the maximum loss threshold for each image in the first set of images, at 2118 the method returns to 2112. The following distance function is evaluated for a second image at 2112, and method 2100 proceeds to sub-act 2114 (and 2116 if appropriate) similar to as discussed regarding the first image. This cycle is repeated for each image in the first set of images.
In the first example, the model is trained by repeating evaluation of the distance loss function for a first image. As discussed above, this can be performed for each image in the first set of images, until the distance loss function as evaluated for each image is within the maximum loss threshold. Alternatively, this can be performed until the distance loss function as evaluated for a threshold amount of images, such as 90% of the images, is within the maximum loss threshold. In this way, loss can be minimized for each image (or a satisfactory amount of images) in the first set of images.
In a second example, at 2112 the distance loss function is determined for the first image similarly as discussed above for the first example. As above, evaluating Equation (1) results in a distance loss L1=0.51. At 2114, loss L1 is compared to a maximum loss threshold, which in this example is 0.25. Since 0.51 is greater than 0.25, loss L1 is not within the maximum loss threshold, and method 2100 proceeds to act 2116. At 2116, the model is adjusted per a machine learning adjustment process, after which method 2100 proceeds to a second iteration of act 2112. In this second example, the second iteration of act 2112 is run instead on a second image. The second image is associated with vehicle presence label P2=1 and distance label D2=2 m. In this case, the model determines vehicle presence p2=0.93 and distance as d2=1.7 m. With these values, evaluating Equation (1) results in a distance loss L2=0.3049. At 2114, loss L2 is compared to a maximum loss threshold, which in this example is 0.25. Since 0.3049 is greater than 0.25, loss L2 is not within the maximum loss threshold, and method 2100 proceeds to act 2116. At 2116, the model is again adjusted per a machine learning adjustment process, after which method 2100 proceeds to a third iteration of act 2112. In this second example, the third iteration of act 2112 is run instead on a third image. The third image is associated with vehicle presence label P3=1 and distance label D3=3.5 m. In this case, the model determines vehicle presence p3=0.95 and distance as d3=3.3 m. With these values, evaluating Equation (1) results in a distance loss L3=0.2025. In a third iteration of act 2114, loss L3 is compared to the maximum loss threshold of 0.25. Since 0.2025 is less than 0.25, loss L3 is within the maximum loss threshold. If no auxiliary criteria are specified (i.e. act 2118 is not included), method 2100 proceeds to act 2120, where the trained model is output.
For a case where an auxiliary criteria is specified in the second example, which requires that the loss be within the maximum loss threshold for each image in the first set of images, at 2118 the method returns to 2112. The following distance function is evaluated for a fourth image at 2112, and method 2100 proceeds to sub-act 2114 (and 2116 if appropriate) similar to as discussed regarding the first image. This cycle is repeated for each image in the first set of images. Further, because the loss function for the first and second images was determined as being greater than the maximum loss threshold, sub-acts 2112, 2114, and 2116 (as appropriate) are performed again for the first and second images.
In the second example, the model is trained by iteratively evaluating the distance loss function, on different images. In this way, loss can be minimized for a plurality of images (or a satisfactory amount of images) in the first set of images.
Once the model is trained, it can be used in detection of tailgating. In this regard,
At 2202, image data including at least one image is accessed. Each image in the image data represents a perspective from a second vehicle (which may be following a first vehicle, as is to be determined by method 2200).
In some implementations, accessing the image data in act 2202 comprises accessing stored image data (e.g. simulated or captured image data which is stored at a non-transitory processor-readable storage medium). In other implementations, accessing the image data in act 2202 comprises capturing the image data (e.g. image data from an image capture device is provided directly to at least one processor which applies the tailgating detection algorithm). In yet other implementations, accessing the image data comprises receiving the image data by a communication interface of the system or device which is performing method 2200 (e.g. from a remote device or datastore).
At 2204, the at least one processor determines whether a first vehicle is represented in an image of the image data. For example, the at least one processor can run a feature or object detection model (such as a YOLO model) to detect vehicles in the image data. If a first vehicle is not represented in an image of the image data, then method 2200 proceeds to act 2220, where the at least one processor determines that the second vehicle is not tailgating the first vehicle (since there is no first vehicle to tailgate). If a first vehicle is detected in the image data at 2204 (or if act 2204 is not performed), method 2200 proceeds to act 2206.
At 2206, the at least one processor applies a following distance determination model to determine a distance between the first vehicle and the second vehicle. The applied model can be any of the models discussed herein, such as a model trained as discussed with reference to
At 2210, the at least one processor determines whether tailgating criteria are met. In method 2200 in
At 2212, the at least one processor determines whether the distance determined at 2206 is within tailgating distance criteria. That is, the at least one processor determines whether the distance between the first vehicle and the second vehicle is an unsafe distance.
Generally, tailgating distance criteria is dependent on speed of travel of the vehicles. Equation (2) below illustrates an exemplary tailgating threshold distance. For implementations which use tailgating threshold distance as tailgating distance criteria, when a distance between two vehicles is less than the tailgating threshold distance, the tailgating distance criteria is satisfied.
In Equation (2), DT represents tailgating threshold distance, and v represents speed (typically of the second vehicle) in kilometers per hour (km/h). Stated differently, in Equation (2), a safe following distance is approximately four meters for every 10 km/h of speed of the vehicle. In the example, v represents speed of the second vehicle. This is because the speed of the second vehicle is more readily available (e.g. is collected by a telematics monitoring device installed in the second vehicle, or by hardware associated with the image capture device in the second vehicle). Speed of the first vehicle is often more difficult to obtain, because the first vehicle may not be associated with the same telematics system as the second vehicle, and therefore data may not be collected from the first vehicle. However, in some implementations the speed of the first vehicle could be determined relative to the speed of the second vehicle, by determining a difference in distance between the first vehicle and the second vehicle over time. In other implementations, the speed of the first vehicle could be determined by a machine learning model trained to estimate vehicle speed. In yet other implementations, the first vehicle can be part of the same telematics system as the first vehicle, and therefore speed data for the first vehicle may be accessible.
Equation (2) above is a generalization. More factors can be taken into account to arrive at a more specific tailgating distance criteria, thus reducing false positive and false negative tailgating determinations.
Tailgating when: DS2>(DF+DS1) (3)
In Equation (3), if vehicle 2320 is not able to stop within the distance presently between the vehicles, plus the distance for vehicle 2310 to stop, then the following distance DF is considered as within tailgating distance criteria.
Stopping distances for vehicles (such as DS1 and DS2 above, collectively DS) can take into account a number of factors, such as vehicle speed, vehicle weight (weight of vehicle itself, possibly including load), road coefficient of friction (e.g. based on weather data indicating good weather, rain, snow, etc.), driver response time (e.g. based on a historical driver profile), or any other appropriate factors.
In the interests of safety, it is generally preferred to bias determination of a stopping distance for a lead vehicle (e.g. vehicle 2310 in
Similarly in the interests of safety, it is generally preferred to bias determination of a stopping distance for a following vehicle (e.g. vehicle 2320 in
Returning to method 2200 in
In accordance with act 2204 of method 2200, vehicle 2420 is identified in images 2400A. In
In one exemplary implementation, a feature detection model is applied to identify road lanes (e.g. based on road edges 2408, 2409, and dividing line 2406). Once lanes are identified, the at least one processor determines a lane of travel of the vehicle carrying the image capture device (in the illustrated example, lane 2402). Vehicles travelling in the same lane as the vehicle with the image capture device are considered to be “in front” of the second vehicle in the context of sub-act 2214 in method 2200. In this exemplary implementation, vehicle 2420 in image 2400A and vehicle 2420 in image 2400B are considered “in front” of the second vehicle, whereas vehicle 2420 in image 2400C is not considered “in front” of the second vehicle.
In another exemplary implementation, distances from the edges of captured images are used to determine whether the lead vehicle (vehicle 2420 in the illustrated example) are travelling “in front” of the second vehicle in the context of method 2200.
In this regard,
In the example of
In some scenarios, the first vehicle driving in front of a second vehicle does not necessarily result in image data where the first car appears horizontally centered in the image data. For example, as discussed above with reference to
In view of the above, calibration can be performed such that the horizontal distance threshold accounts for non-centered bias of the image data (e.g. due to the image capture device being positioned away from a horizontal center of the second vehicle in the context of method 2200 of
In the case of
Generally, an optimal horizontal distance threshold is determined as appropriate for a specific application or implementation. This is because different camera hardware, different camera positioning, different vehicle features, or any number of other factors can influence optimal horizontal distance threshold.
Returning to method 2200 in
In some implementations, outputting the indication of tailgating comprises outputting an alert to a driver of the second vehicle (such as by output interface 218 in
In some implementations, outputting the indication of tailgating comprises transmitting an alert, notification, or report of the tailgating situation to a management device (such as any of client device 104, cloud server 106, or local server 118 discussed with reference to
Outputting an indication of tailgating as in act 2224 is not limited to outputting a single indication of tailgating. In some implementations, an indication of tailgating can be output to the driver (e.g. as discussed with reference to
Method 2700 can be implemented for example within the context of detecting tailgating. For example, method 2700 in
Method 2700 is discussed below in the context of a first example scenario illustrated in
Returning to method 2700, at 2702, an image is accessed including a representation of a first vehicle from a perspective of a second vehicle behind the first vehicle. In the example of
At 2704, at least one processor of the system or device performing method 2700 determines at least one image attribute based on a positional measure between the first vehicle as represented in the image and at least one boundary of the image. Such a positional measure can be based on physical features of the vehicle represented in the image (e.g. pixels of the image representing edge features of the vehicle), or by identifying delineations of the vehicle. For example, a feature detection model (such as the YOLO detection model) can be run on the image, to identify the first vehicle in the image. A bounding box (such as bounding box 2822 in image 2800 shown in
In some exemplary implementations, a feature detection model can be applied to identify road lanes, such as discussed for example with reference to
In the example of
Optionally, in the example of
At 2706 in method 2700, the at least one processor applies a following distance determination model to determine the following distance based on the at least one image attribute determined at 2704. This model is trained to predict or determine following distance based on the at least one image attribute, as opposed to by analysis of the image itself. Detailed discussion of such a model and how it can be trained can be found later with reference to
Generally, the further a leading vehicle is from a tailgating vehicle (the greater the physical distance between the first and second vehicle in method 2700), the larger distance H1 will be, and the smaller the distance H2 will be. This is because a further away vehicle will typically appear higher up in images captured by a front-facing camera in a vehicle. As such, in the example of
At 2708, the determined following distance is output. In an exemplary implementation where method 2700 is used to determine following distance in act 2206 of method 2200, outputting the following distance can include preparing the following distance for a next act in the method 2200 (e.g. by storing the determined following distance in at least one non-transitory processor-readable storage medium). In another exemplary implementation, outputting the following distance can include outputting the following distance to a driver of the second vehicle, such as via a display or audio output device. In yet another exemplary implementation, outputting the following distance can include providing the determined following distance to a database (e.g. at a device or server such as client device 104, cloud server 106, or local server 118 in
d1A2=H12+W12 (4)
d1B2=H12+W22 (5)
With reference to method 2700 in
In Equation (6), MC is a cumulative measure usable as the at least one image attribute in act 2704 of method 2700. The denominator k1k2 (product of image height and width) represents size of the image. By summing d1A2 and d1B2, and dividing by the image size (k1k2), a normalized measure is determined which produces similar results regardless of image size. In this way, the measure can be applied to different sizes of images (e.g. from different image capture devices, or preprocessed/cropped in different ways), and still produce predictable results. However, dividing by k1k2 is not necessary. For example, in some implementations image resolution can be controlled (e.g. by only providing images of a certain resolution, scaling the image to have a certain resolution, and/or cropping the image to have a certain resolution.
Determining the at least one image attribute as including a cumulative measure of d1A and d1B advantageously compensates for features which are not indicative of following distance. For example, if vehicle 2820 is positioned towards the left side of image 2810, W2 will be smaller, whereas W1 will be larger. This in turn results in a larger d1A compared to a smaller d1B, even for the same following distance. On the other hand, if vehicle 2820 is positioned towards the right side of image 2810, W2 will be larger, whereas W1 will be smaller. This in turn results in a larger d1B compared to a smaller d1A, even for the same following distance. By combining d1B with d1A as a cumulative measure, horizontal positioning of the vehicle 2820 within the image is compensated for, such that the cumulative measure is more indicative of following distance.
Efficacy of the cumulative measure can be determined as discussed with reference to Equations (7) and (8) below. First, d1A2 and d1B2 in Equation (6) can be substituted with their equivalents in Equations (4) and (5), which results in Equation (7) below:
By differentiating MC with respect to H1, a quantity of change in MC per unit change of H1 can be determined, as shown in Equation (8) below:
As can be seen in Equation (8), MC (as normalized by dividing by k1k2) changes at a rate of 4 times H1 (also as normalized by dividing by k1k2). If normalization is omitted (the division by k1k2 is not performed, for both MC or H1 as the image attribute), MC still changes at a rate of 4 times H1. That is, MC is 4 times more sensitive than H1. Thus, MC can be useful for determining following distance more accurately than relying on H1 as discussed above with reference to
For plot 3000 in
As can be seen in both plots 3000 and 3050, the image attribute changes significantly with True Distance from a distance of 0 meters to a distance of approximately 10 to 15 meters. At true distances greater than this, change in H1 and MC becomes notably smaller. That is, at following distances greater than 15 meters, change in position of the first vehicle in the image due to change in following distance becomes low.
For the size and quality of images used in the example of
In some implementations, multiple cumulative measures can be included in the at least one image attribute used in act 2704 of method 2700. Examples are discussed below with reference to
The determined first cumulative measure MC1 and second cumulative measure MC2 can be used as the at least one image attribute in act 2704 of method 2700. The inclusion of additional cumulative measure further increases accuracy of the following distance determination.
Any number of appropriate cumulative measures can be determined as appropriate for a given application, as discussed further with reference to
A third cumulative measure MC3 can be determined using Equation (6) as discussed above, substituting d3A for d1A and d3B for d1B. A fourth cumulative measure MC4 can be determined using Equation (6) as discussed above, substituting d4A for d1A and d4B for d1B.
The determined first cumulative measure MC1, second cumulative measure MC2, third cumulative measure MC3, and fourth cumulative measure MC4, can be used as the at least one image attribute in act 2704 of method 2700. The inclusion of additional cumulative measure further increases accuracy of the following distance determination. Alternatively, any subset of cumulative measures could be used as the at least one image attribute; there is no requirement that all or specific cumulative measures be used.
A fifth cumulative measure MCs can be determined using Equation (6) as discussed above, substituting d5A for d1A and d5B for d1B. A sixth cumulative measure MC6 can be determined using Equation (6) as discussed above, substituting d6A for d1A and d6B for d1B.
A seventh cumulative measure MC7 can be determined using Equation (6) as discussed above, substituting d2A for d1A and d7B for d1B. An eight cumulative measure MCs can be determined using Equation (6) as discussed above, substituting d8A for d1A and d8B for d1B.
The determined first cumulative measure MC1, second cumulative measure MC2, third cumulative measure MC3, fourth cumulative measure MC4, fifth cumulative measure MCs, sixth cumulative measure MC6, seventh cumulative measure MC7, and eighth cumulative measure MC8, can be used as the at least one image attribute in act 2704 of method 2700. The inclusion of additional cumulative measures further increases accuracy of the following distance determination. In the illustrated example, if all eight cumulative measures are used, the at least one image attribute is based on respective positional measures between each corner of the image boundary 2810 and each corner of the first vehicle as represented in the image. Alternatively, any subset of cumulative measures could be used as the at least one image attribute; there is no requirement that all or specific cumulative measures be used.
In pipeline 3700, image data 3702 is accessed as described earlier (e.g. received, retrieved from storage, or captured by an image capture device). At 3704, at least one processor optionally preprocesses the data as appropriate. For example, the image data can be cropped to a defined resolution, or image correction can be applied such as distortion to compensate for skewing in the image due to properties of the image capture device.
At 3706, a vehicle detection model is run on the image. The vehicle detection model can be a model specifically trained to detect vehicles, or can be a general object detection model which is capable of detecting vehicles. As an example, the YOLO 5 model can be run on the image data. If no vehicle is detected, or no vehicle is detected as being in front of a vehicle having a perspective which the image data represents, an indication of not following a vehicle is output at 3714.
If a vehicle is detected, a feature extractor 3708 (e.g. a feature extraction model) is run on the image data (e.g. by any appropriate at least one processor) to identify specific features, such as those discussed above with reference to
By using a feature extractor to identify specific (numerical) features, which are subsequently used by a trained model, efficiency of the pipeline is increased compared to providing the entire image data to a following distance detection model. This is because the specifically identified features are much smaller as data, compared to the hundreds, thousands, or even millions of pixels in an image.
In pipeline 3800, image data 3802 is accessed as described earlier (e.g. received, retrieved from storage, or captured by an image capture device). Image data 3802 includes a plurality of images representing following situations (situations where a first vehicle is in front of a second vehicle). Image data 3802 includes distance labels or metadata which indicates a true distance between a vehicle represented in each image (first vehicle) and a vehicle having a perspective which the image data represents (second vehicle). Such data can be labelled real-world data, or simulated data as discussed earlier.
At 3804, at least one processor optionally preprocesses the image data as appropriate. For example, the image data can be cropped to a defined resolution, or image correction can be applied such as distortion to compensate for skewing in images due to properties of the image capture device.
At 3806, a vehicle detection model is run on at least one image of the plurality of images. The vehicle detection model can be a model specifically trained to detect vehicles, or can be a general object detection model which is capable of detecting vehicles. As an example, the YOLO 5 model can be run on at least one image in the plurality of images.
A feature extractor 3808 (e.g. a feature extraction model) is run on the at least one image to identify specific features related to detected vehicles, such as those discussed above with reference to
For the at least one image, the following distance detection model predicts following distance between the first vehicle and the second vehicle. The predicted following distance is input to a loss function 3812. In this example, the loss function represents a difference between the predicted following distance and the true distance as indicated in the distance label or metadata for the image. Results from the loss function are provided back to the following distance detection model (e.g. by back propagation), so that the following distance detection model can be trained and refined. The process of determining a following distance, evaluating loss function 3812, and providing feedback to train the model, can be repeated as necessary until a satisfactory level of accuracy is achieved (loss function 3812 produces a low enough loss). For example the process can be repeated for each image in the plurality of images, can be repeated multiple times for any number of images, or any other appropriate scheme. The model as trained can be output at 3814 (e.g. stored at a non-transitory processor-readable storage medium for later use, or transmitted to another device). Further examples are discussed below with reference to
At 3902, image data is accessed by at least one processor of the device performing method 3900. The image data includes at least a first set of images, such as the first plurality of images output at 524 in method 500 discussed with reference to
At 3904, for at least one image in the first set of images, at least one image attribute is determined by the at least one processor. The at least one image attribute is based on a positional measure between the first vehicle as represented in the image, and at least one boundary of the image. The at least one image attribute can be determined for examples as discussed earlier with reference to
At 3910, a following distance loss function is minimized over the first set of images. In the example of
At 3914, the determined loss is compared to a maximum loss threshold by the at least one processor. If the determined loss is not within the maximum loss threshold, method 3900 proceeds to act 3916 where the model is adjusted (e.g. by adjusting weights and biases of the model with the aim of reducing loss). In one exemplary implementation, backpropagation is implemented to adjust weights and biases of the model. One skilled in the art can implement any appropriate model structure and means for adjusting the model, as appropriate for a given application. After the model is adjusted at 3916, method 3900 returns to act 3912, where the following distance function is evaluated for at least one image of the first set of images. The at least one image for which the following distance loss function is evaluated can be the same at least one image as before, such that the adjustments to the model are “tested” against the same image data. Alternatively, the at least one image for which the following distance loss function is evaluated can be a different at least one image, such that the model is adjusted by moving through the first set of images.
Acts 3912, 3914, and 3916 can be iterated any appropriate number of times, until loss is within the maximum loss threshold at 3914, in which case method 3900 proceeds to 3918. At 3918, auxiliary criteria for the model are evaluated. If the auxiliary criteria are not satisfied, method 3900 returns to act 3912, where the following distance loss function is evaluated. Auxiliary criteria can include various criteria. As one example, auxiliary criteria can require that the loss function be within a maximum loss threshold for each image in the first set of images. That is, even if the loss function is within a maximum loss threshold for a first image, the auxiliary criteria can require that each image be evaluated prior to outputting the trained model. As another example, auxiliary criteria can require that the loss function be within a maximum loss threshold for at least a defined amount of images in the first set of images. That is, even if the loss function is within a maximum loss threshold for a first image, the auxiliary criteria can require that the loss function be within the maximum loss threshold for a defined amount (e.g. 90%) of the images in the first set of images. As another example, auxiliary criteria can require that the loss function be evaluated for at least a defined amount of images (e.g. 90%) in the first set of images.
Act 3918 is optional. In one exemplary implementation, evaluating the following distance loss function for at least one image of the first set of images in act 3912 comprises evaluating the following distance loss function for each image of the first set of images (or for a defined amount of images in the first set of images), such that criteria regarding quantity of images to be evaluated are inherently satisfied.
If the auxiliary criteria are satisfied at 3918 (or if act 3918 is not included), method 3900 proceeds to act 3920. At 3920, the model is considered as a “trained” model, and is output for use. For example, the trained model can be sent to another device for storage, distribution, and/or application, or can be stored at a non-transitory processor-readable storage of the device which performed the training.
Exemplary scenarios for training models (and how auxiliary criteria are implemented) are discussed with reference to
Method 4000 can be implemented for example within the context of detecting tailgating. For example, method 4000 in
Method 4000 is discussed below in the context of a first exemplary scenario illustrated in
Returning to method 4000, at 4002, an image is accessed by a system or device performing method 4000. The image includes a representation of a first vehicle from a perspective of a second vehicle behind the first vehicle. The image further represents a common lane of travel of the first vehicle and the second vehicle. Image 4100 shown in
At least one processor of the system or device which performs method 4000 can optionally preprocess the accessed image data as appropriate. For example, the image data can be cropped to a defined resolution, or image correction can be applied such as distortion to compensate for skewing in the image due to properties of the image capture device. As examples, radial distortion and/or tangential distortion of the image data can be compensated for. In some implementations, the accessed image data is already pre-processed to be of a desired resolution and/or to have distortion corrected, prior to access and utilization in method 4000.
Further, in some implementations preliminary analysis can be performed to determine whether an image includes a representation of a first vehicle from a perspective of a second vehicle behind the first vehicle. An example of such analysis is discussed earlier with reference to
At 4004, at least one processor determines a first vertical position in the image representing a bottom of the first vehicle. In this regard,
In order to determine the first vertical position, a variety of techniques could be used. In one example, an object detection model (such as a YOLO model) can be run on image 4100, to output a bounding box which surrounds vehicle 4120 (similar to bounding box 2422 shown in
At 4006, a second vertical position 4142 in the image is accessed (e.g. by the at least one processor). The second vertical position 4142 represents a static physical distance from the second vehicle. In this regard, the second vertical position 4142 can be determined during a calibration of the image capture device installed in the second vehicle, where a particular image distance (e.g. number of pixels) from a bottom boundary of images captured by the image capture device is correlated to a particular physical distance from the second vehicle. In the example of
In some implementations, the static physical distance can be 0, such as by placing marker 4212 immediately adjacent vehicle 4220. This can simplify distance calculations, by may not be possible in all configurations, particularly if the marker 4212 cannot be seen in the field of view of image capture device 4224.
Due to perspective, in ordinary image data from an image capture device in a vehicle, the relationship between following distance in the real world (physical distance between the first and second vehicles in image 4100) and image distance in the image data (e.g. quantity of pixels between the first vehicle and the second vehicle as representing in image 4100) is not fixed. That is, the higher up a pixel is vertically in the image (the further forward in physical space the pixel represents), the greater the distance represented by the pixel. Consequently, it is challenging to determine following distance between vehicles based on image data. To address this, method 4000 in
In act 4008, the at least one processor determines a transformed first vertical position by applying an image transformation matrix to the first vertical position determined in act 4004. In act 4010, the at least one processor determines a transformed second vertical position by applying the image transformation matrix to the second vertical position. The image transformation matrix is a matrix which, when applied to image data for a particular image capture device setup, transforms the image data to a bird's eye view of the image. That is, the image transformation matrix transforms the image data to a top-down view, where there is a fixed relationship between physical following distance and image distance (e.g., image distance can be converted to physical distance by applying a fixed ratio which relates image distance to physical distance). That is, in the transformed image, a pixel represents a set physical distance, regardless of the position of the pixel in the image. This is discussed in detail below with reference to
The transformed image data (such as shown in
In some implementations, acts 4008 and 4010 of method 4000 can entail transforming a significant portion of the image, such as shown in
Returning to method 4000 in
At 4014, the at least one processor determines following distance as a physical distance between the first vehicle and the second vehicle based on the image distance determined at 4012 and the static physical distance discussed with reference to
At 4016, the determined following distance is output. In an exemplary implementation where method 4000 is used to determine following distance in act 2206 of method 2200, outputting the following distance can include preparing the following distance for a next act in the method 2200 (e.g. by storing the determined following distance in at least one non-transitory processor-readable storage medium). In another exemplary implementation, outputting the following distance can include outputting the following distance to a driver of the second vehicle, such as via a display or audio output device. In yet another exemplary implementation, outputting the following distance can include providing the determined following distance to a database (e.g. at a device or server such as client device 104, cloud server 106, or local server 118 in
Method 4400 is discussed below in the context of a first example scenario illustrated in
Returning to method 4400, at 4402 an image is accessed by a system or device performing method 4400. The image includes a representation of a lane of travel from a perspective of a vehicle. Image 4500 shown in
At least one processor of the system or device performing method 4400 can optionally preprocess the accessed image as appropriate. For example, the image data can be cropped to a defined resolution, or image correction can be applied such as distortion to compensate for skewing in the image due to properties of the image capture device. As examples, radial distortion and/or tangential distortion of the image data can be compensated for. In some implementations, the accessed image data is already pre-processed to be of a desired resolution and/or to have distortion corrected, prior to access and utilization in method 4400.
At 4404, at least one processor of the system performing method 4400 determines a first boundary of the lane of travel. At 4406, the at least one processor determines a second boundary of the lane of travel. In the example of image 4500 of
Determining the first boundary and the second boundary can entail determining a mathematical representation for each boundary, for example in accordance with Equation (21) below.
y=mx+b (21)
Equation (21) is a mathematical representation of a straight line, where y is a vertical value, m is slope of the line, x is a horizontal value, and b is the y intercept (where the line crosses the vertical axis when x=0). Determining the first boundary at 4404 can entail determining a first slope m1 and a first y intercept b1 for the first boundary. Likewise, determining the second boundary at 4406 can entail determining a second slope m2 and a second y intercept b2 for the second boundary. Equation (21) is merely exemplary, and alternative equations or mathematical representations for lines could be used instead.
In some scenarios, many lane markings for a plurality of lanes of travel may be visible in an image being analyzed. In such a case, where an origin of the image is at the top-left of the image, lines having negative slope can be considered as being left of the vehicle which carries the image capture device (vehicle having hood 4122 in image 4100), and lines having positive slope can be considered as being right of the vehicle which carries the image device, from the perspective of the image capture device. Further, from a midpoint of the image, the closest line having negative slope can be considered as a left boundary of a lane of travel of the vehicle carrying the image capture device (vehicle having hood 4122 in image 4100), and the closest line having positive slope can be considered as a right boundary of a lane of travel of the vehicle carrying the image capture device. This left boundary and right boundary can be the first boundary and the second boundary determined at 4404 and 4406.
At 4408, the at least one processor determines a vanishing point for the image as a point where the first boundary and the second boundary intersect each other. For example, the equation for the first boundary and the equation for the second boundary can be combined (e.g. one equation substituted into the other) to determine coordinates for the vanishing point. The coordinates for the vanishing point can be expressed as Xvp and yvp. In the example of
At 4410, a region of interest in the image is accessed. This region of interest is shown in
The region of interest can be accessed in different ways. In some implementations, the region of interest may be received at least partially as user input. In particular, a user of the system or device performing method 4400 may interact with a user input device to provide at least some attributes of the region of interest. For example, the user may input a position of the top edge (e.g. 4544), the bottom edge (e.g. 4542), a point on the left edge (e.g. 4550), and a point on the right edge (e.g. 4552). As another example, the user may input only the position of the top edge and the bottom edge, and the left edge and right edge can be inferred by the at least one processor (e.g. as edges extending from boundaries 4510 and 4512 of the image towards the vanishing point 4540). In other implementations, the region of interest can be autonomously determined or generated by the at least one processor, by defining the edges of the region of interest within reasonable limits. In particular, the bottom edge of the region of interest should be above a hood of the vehicle as represented in the image data (e.g. hood 4522 in
Returning to method 4400, at 4412, the at least one processor determines a transformed region of interest where a left edge of the transformed region of interest is parallel to a right edge of the transformed region of interest. As can be seen in
To determine the transformed region of interest, coordinates of the four corners are determined by the at least one processor which will result in edges 4646 and 4648 being parallel. To achieve this, the width of top edge 4644 and the width of the bottom edge 4642 should be equal. Thus, setting one corner as an origin, e.g. by setting top-left corner 4654 to have coordinates of (0,0), the remaining corners can be defined relative thereto by an arbitrary size (e.g. 720, in this example). In particular, top-right corner 4656 can have coordinates of (719,0); bottom-left corner 4650 can have coordinates of (0,719), and bottom-right corner 4652 can have coordinates of (719,719). This results in a square transformed region of interest, but in some implementations the vertical size may be different from the horizontal size, which results in a rectangular transformed region of interest.
The at least one processor determines a transformation matrix H (which can also be referred to as a homography matrix), which when applied to points of the region of interest, results in the points of the transformed region of interest, as shown in Equation (22) below.
In Equation (22), u and v are coordinates in the input image, and ut and vt are corresponding coordinates in the transformed image. With reference to the example of
Based on the provided region of interest and transformed region of interest points, an image transformation matrix H can be determined, for example using a known function. The open computer vision library (OpenCv) includes such a function as “getPerspectiveTransform”. For the exemplary points listed above, H can be calculated as:
In addition to determining the image transformation matrix H, a ratio of image distance to physical distance for the vertical axis of the image (as transformed according to the image transformation matrix) can also be determined. This can be performed in multiple ways, some examples of which are discussed below with reference to
In another implementation, the at least one processor can first determine a ratio of image distance to physical distance for a horizontal axis of an image (as transformed according to the image transformation matrix), based on a received measurement for a lane of travel and an image distance width of the lane of travel as represented in image data captured by an image capture device in a vehicle.
To determine following distance, a ratio of image distance to physical distance for a vertical axis of the image can be determined by the at least one processor, based on the ratio of image distance to physical distance for the horizontal axis of the image as discussed with reference to
For a particular image capture device (camera), a set of camera matrices can be used which converts real-world coordinates with respect to the camera to pixel coordinates for an image captured by the image capture device, in accordance with Equation (23) below.
Generally, when the image capture device captures image data, the image data is a two-dimensional projection of the three-dimensional world in the field of view of the camera. In this regard, Xw, Yw, and Zw represent an x value, a y value, and a z value for a real-world coordinate in a world coordinate system (i.e. in the Xw axis, Yw axis, and Zw axis discussed above). up represents a horizontal value (in the Up axis), and vp represents a vertical value (in the Vp axis), respectively, in a two-dimensional image (e.g. image Ip) captured by the camera, which corresponds to projected position of the real-world coordinate.
The matrix [r1 r2 r3t] represents an Extrinsic Matrix for the camera, which converts world coordinates for the world coordinate system to coordinates in a camera coordinate system, and depends on position and orientation of the camera. The matrix [r1 r2 r3t] is a 4 by 4 matrix where r1, r2, and r3 are rotational vectors and t is a translation vector. Vectors r1, r2, and r3 have the convenient property where the length of each vector is 1, which will be utilized later. In some implementations the extrinsic matrix is not strictly necessary (or is an identity matrix), in particular if the origin of the world coordinate system is set as the camera position in the world and corresponding axes are aligned (such that the world coordinate system and the camera coordinate system are equivalent). This is not the case in the example of
In the example of
To convert a real-world coordinate to a coordinate in a two-dimensional image transformed in accordance with image transformation matrix H discussed earlier, we can combine Equations (22) and (24), such as by using the matrix containing up and vp from Equation (24) as the matrix containing input coordinates u and v in Equation (22), to arrive at Equation (25) below.
As discussed earlier, the image as transformed by image transformation matrix H represents a top view (bird's eye view) of the scene.
A vertical image distance for image It (e.g. number of pixels along axis vt) of a point from the center of the two-dimensional image, when the real-world coordinate which the point is a projection of is 1 meter (or some other appropriate set distance) offset from a center axis of the virtual image capture device 5120 (i.e. 1 meter offset from the Zw axis along the Yw axis) can be represented as ry. Further, from the property of similar triangles (the angle to a real-world coordinate from a camera is equal to the angle from the camera to a two-dimensional projection of the coordinate as captured by the camera), a position of a projected coordinate in image It is proportional to the yw position of the coordinate. Consequently, for a real-world coordinate D a distance d along the Yw axis, the vertical image distance of a projection of coordinate D from a center of the image It is ry*d. That is, ry represents a ratio of vertical image distance in the transformed image It to physical distance along the Yw axis (or pixels in the image per meter in the real world).
Further, the relationship between image distance and position of the real-world coordinate is not necessarily the same in the horizontal direction and the vertical direction of the transformed image. To this end, an image distance in the horizontal direction (Ut axis) of a point from the center of the two-dimensional image It, when the coordinate which the point is a projection of is offset horizontally from the Zw axis (i.e. along the Xw axis) by 1 meter (or some other appropriate set distance) can be represented as rx. The above relationships can be expressed as in Equations (26) and (27) below.
ut=rxxw+Cx (26)
vt=ryyw+Cy (27)
In the scenario 5100 the virtual image capture device 5120 is positioned with an optical center aligned with axis Zw. As a result, coordinates in virtual images which could be captured by virtual image capture device 5120 (or images transformed in accordance with matrix H) are offset from a center of said image (due to the convention that an origin of image data is at a top-left corner of the image data, though other conventions could be used as appropriate). Consequently, Equation (26) includes an offset for the optical center Cx (in the Ur-axis) of virtual image capture device 5120, and Equation (27) includes an offset for the optical center Cy (in the Vt-axis) of virtual image capture device 5120. Thus, Equation (26) expresses that a horizontal coordinate in the image It can be determined based on the horizontal image distance to physical distance ratio rx multiplied by the real-world horizontal distance (in the Xw direction), accounting for horizontal offset from the origin of the image data (top left corner of the image It). Similarly, Equation (27) expresses that a vertical coordinate in the image/can be determined based on the vertical image distance to physical distance ratio ry multiplied by the real-world distance in the Yw direction, accounting for offset from the origin of the image data (top left corner of the image It).
Putting Equations (26) and (27) in matrix form results in Equation (28) below:
Comparing Equations (25) and (28) results in Equation (29) below.
Equation (29) can be simplified to Equation (30) below.
As mentioned earlier, vectors r1 and r2 each have a length of 1. Consequently, solving Equation (30) using known linear algebra techniques (row and column operations) results in Equation (31) below.
In Equation (31), h1 is the first column of (HM)−1, and h2 is the second column of (HM)−1. Equation (31) expresses a relationship between a ratio of image distance to physical distance for a vertical axis of the image (ry) to a ratio of image distance to physical distance for a horizontal axis of the image (rx), for an image captured by image capture device 5110. Using this relationship, a ratio of image distance to physical distance for a horizontal axis of the image, such as determined as discussed with reference to
The procedures described above and with reference to
In the context of many of the methods discussed herein (such as method 2200 in
At least one processor of any of the systems or devices discussed herein can identify vehicle 5220, such as by executing an object detection model. Vehicle 5220 is shown as being identified with a bounding box 5240 in the example of
Further, at least one processor of any of the systems or devices discussed herein can identify boundaries 4134 and 4136 using any of the techniques discussed earlier (for example as discussed with reference to acts 4404 and 4406 in
In the illustrated implementation, vehicle 5220 can be determined as travelling in the same lane as the second vehicle having hood 5222 if both points 5244 and 5246 lie within the boundaries of the lane of travel of the second vehicle (the lane bounded by boundaries 4134 and 4136 in the example of
In some implementations, vehicle 5220 may be determined as travelling in the same lane as the second vehicle having hood 5222 if EITHER of points 5244 and 5246 lie within the boundaries of the lane of travel of the second vehicle (the lane bounded by boundaries 4134 and 4136 in the example of FIG. 52). Such an implementation favors detection of vehicles which can still pose a tailgating risk, even as they are transitioning into or out of the lane of travel, as an example. In such an implementation, vehicle 5220 in
At 5402, a set of images is accessed. Each image in the set of images is from a perspective of a subject vehicle, and includes a representation of another vehicle positioned in front of the subject vehicle. That is, each image in the set of images represents a following situation, where the subject vehicle is “following” a lead vehicle (another vehicle). Further, each image in the set of images is associated with a respective time. That is, each image can be timestamped (e.g. visually on the image itself, or as metadata associated with the image). Such a timestamp typically corresponds to a time when the image was captured.
Each image in the set of images does not need to represent the same “other” vehicle (or the same lead vehicle). Rather, the set of images can represent a plurality of different following situations where the subject vehicle is positioned behind any other vehicle. It is an objective of method 5400 to monitor following distance over time, as pertinent to the subject vehicle, regardless of what other vehicle the subject vehicle is actually following.
The set of images accessed in act 5402 can for example include images akin to those illustrated in
At least one processor of the system or device performing method 5400 (e.g. of the vehicle device, such as processor 206) can optionally preprocess the accessed set of images as appropriate. For example, the images can be cropped to a defined resolution, or image correction can be applied such as distortion to compensate for skewing in the image due to properties of the image capture device. As examples, radial distortion and/or tangential distortion of the set of images can be compensated for. In some implementations, the accessed set of images is already pre-processed to be of a desired resolution and/or to have distortion corrected, prior to access and utilization in method 5400.
Further, prior to method 5400 (or as a first step in act 5400), the set of images can be identified, extracted, or filtered from a plurality of images as captured by an image capture device positioned at the subject vehicle. In particular, at least one processor which performs method 5400 can access a plurality of images captured by the image capture device, and identify the set of images as images of the plurality of images which include another vehicle positioned in front of the subject vehicle. That is, the at least one processor can analyze the plurality of images to identify images which represent a following situation where the subject vehicle is positioned behind another vehicle (lead vehicle), and only the identified images can be included in the set of images. Such identification of a following situation can be performed by running an object detection model on the plurality of images, as discussed in detail earlier with reference to at least
Further still, identifying the set of images as images of the plurality of images which include another vehicle positioned in front of the subject vehicle can further comprise identifying the set of images as images of the plurality of images which include another vehicle positioned in front of the subject vehicle AND in a common lane of travel with the subject vehicle, as discussed earlier with reference to at least
At 5410, following distance data is generated (by the at least one processor of the device which performs method 5400), based on at least a subset of image in the set of images. In particular, the set of images may include a large number of images at a high frequency. For example, an image capture device may capture image data at 15, 30, or 60 frames per second (as non-limiting examples). That is, the set of images may include 15, 30, 60 (or any other appropriate number) of images for each second of time. Following distance over time likely does not need to be determined or logged at this frequency. Instead, following distance over time can be determined at a lower frequency (e.g. 1 Hz, 0.2 Hz, 0.1 Hz; that is once per second, once per 5 seconds, or once per 10 seconds, though any other appropriate frequency could be utilized instead for a given application). Consequently, the subset of images based on which the following distance data may be generated could include only images from the set of images at a preferred frequency (e.g. every 15th, 30th, 60th, 150th, 300th, 600th, image, as non-limiting examples). This advantageously reduces processing burden of method 5400.
In some implementations, the set of images as accessed can already be reduced to include only images at a desired frequency of the image data, or accessing the set of images may comprise selectively accessing images at the desired frequency. Alternatively, the set of images may already include only images at a desired frequency for the following distance data. In such implementations, generating following distance data based on at least a subset of images in the set of images comprises generating the following distance data based on all of the images in the set of images.
Act 5410 in
At 5412, for each image in the subset of images, the at least one processor determines a respective physical distance between the subject vehicle and the other vehicle. This can be performed in accordance with any of the following distance determination techniques or models described herein, including the techniques described with reference to
In some implementations act 5412 comprises applying a following distance determination model to respective image, the following distance determination model being a machine learning model trained based on minimization of a following distance loss function for a training set of images representing a respective lead vehicle from a perspective of a following vehicle associated with a label indicating following distance between the lead vehicle and the following vehicle. Such an implementation is described in detailed earlier with reference to at least
In some implementations act 5412 comprises determining, by the at least one processor, at least one image attribute based on a positional measure between the other vehicle as represented in the respective image and at least one boundary of the image; and applying, by the at least one processor, a following distance determination model to determine a following distance based on the determined at least one image attribute. Such an implementation is described in detail earlier with reference to at least
In some implementations act 5412 comprises determining, by at least one processor, a first vertical position in the image representing a bottom of the other vehicle; accessing, by the at least one processor, a second vertical position in the image representing a static physical distance from the subject vehicle; determining a transformed first vertical position by applying, by the at least one processor, an image transformation matrix to the first vertical position; determining a transformed second vertical position by applying, by the at least one processor, the image transformation matrix to the second vertical position; determining an image distance between the transformed first vertical position and the transformed second vertical position; and determining the following distance as a physical distance between the other vehicle and the subject vehicle based on the determined image distance and the static physical distance. Such an implementation is described in detail earlier with reference to at least
Returning to method 5400 in
Following distance can change relatively quickly. For example, a lead vehicle (other vehicle in method 5400) can slow down abruptly, such that a following vehicle (subject vehicle in method 5400) will also have to adjust its speed accordingly, which commonly results in a change in following distance. As a result, it is desirable to determine following distance over time at a reasonably high frequency (though not necessarily as high frequency as captured image data, as described above). However, when driving there are also commonly long periods where following distance does not change (e.g. highway driving where vehicles are moving at similar speeds), or where a following situation does not occur (e.g. open road). Reporting following distance at a high frequency over such periods is a waste of transmission bandwidth. In view of this, it is desirable to determine and report following distance data at a reasonably high frequency when there are changes in following distance, while avoiding excessive reporting of following distance data for periods where following distance does not change. This is the focus of act 5420 in method 5400. In this context, what is meant by “reporting” following distance data is where a vehicle device determines following distance, and transmits an indication of following distance to another device (e.g. a management device or central server).
At 5420, the at least one processor generates simplified following distance data. This includes sub-acts 5422 and 5424 discussed below.
At 5422, the at least one processor selects data points from the following distance data for inclusion in the simplified following distance data, based on differences between the data points and iteratively-defined reference lines through portions of the following distance data. Examples of how sub-act 5422 can be performed are discussed later with reference to
At 5424, the at least one processor compiles the select data points as the simplified following distance data, and excludes data points which are not identified as select data points.
At 5430, the simplified following distance data is output. As an example, at least one communication interface of the vehicle device transmits the simplified following distance data, to be received by a server or management device (e.g. cloud server 106, client device 104, or local server 118 in
In the example of
Each point of data in
In
In the exemplary implementation, the simplified following distance data is generated by selecting data points from the following distance data, based on differences of the data points to corresponding reference lines. In the scenario of
Further, a candidate data point of the following distance data is identified, where a minimum difference between the data point and the reference line 5530 is greater than a minimum difference between other data points being compared (sequential points between points 5510 and 5512) and the reference line 5530. That is, a candidate data point of the following distance data between points 5510 and 5512 is identified which is the most different from the reference line 5530. In
Following identification of point 5520, a determination is made as to whether the minimum difference between candidate data point 5520 and reference line 5530 exceeds a difference threshold. In practical applications, such a difference threshold could for example be 1 meter, though other difference thresholds are possible as appropriate for a given application. In
Further, the iteratively-defined reference lines are updated to include reference lines which intersect select data point 5520, as is shown in
As discussed earlier with reference to act 5422 in method 5400, the simplified following distance data is generated by selecting data points from the following distance data, based on differences of the data points to reference lines. In the scenario of
Following identification of candidate data point 5521, a determination is made as to whether the minimum difference between candidate data point 5521 and reference line 5535 exceeds a difference threshold. In
Further in the scenario of
Further, a candidate data point of the following distance data is identified, where a minimum difference between the candidate data point and the reference line 5540 is greater than a minimum difference between other data points being compared (sequential data points between points 5510 and 5520) and the reference line 5540. That is, a candidate data point of the following distance data between points 5510 and 5520 is determined which is the most different from the reference line 5540. In
Following determination of point 5522, a determination is made as the whether the minimum difference between point 5522 and reference line 5540 exceeds a difference threshold. In
Further, the iteratively-defined reference lines are updated to include reference lines which intersect select data points 5521 and 5522, as is shown in
As mentioned earlier, the simplified following distance data is generated by selecting data points from the following distance data, based on differences of the data points to reference lines. In the scenario of
Further, a candidate data point of the following distance data is identified, where a minimum difference between the candidate data point and the reference line 5545 is greater than a minimum difference between other data points being compared (sequential points between points 5520 and 5521) and the reference line 5545. That is, a candidate data point of the following distance data between points 5520 and 5521 is identified which is the most different from the reference line 5545. In
Following determination of candidate data point 5523, a determination is made as to whether the minimum difference between candidate data point 5523 and reference line 5545 exceeds a difference threshold. In
Further in the scenario of
A candidate data point of the following distance data is identified, where a minimum difference between the candidate data point and the reference line 5550 is greater than a minimum difference between other data points being compared (sequential points between points 5521 and 5512) and the reference line 5550. That is, a candidate data point of the following distance data between points 5521 and 5512 is identified which is the most different from the reference line 5550. In
Following determination of candidate data point 5524, a determination is made as the whether the minimum difference between candidate data point 5524 and reference line 5550 exceeds a difference threshold. In
Further in the scenario of
Further, a candidate data point of the following distance data is identified, where a minimum difference between the candidate data point and the reference line 5555 is greater than a minimum difference between other data points being compared (sequential points between points 5522 and 5520) and the reference line 5555. That is, a candidate data point of the following distance data between points 5522 and 5520 is identified which is the most different from the reference line 5555. In
Following determination of candidate data point 5525, a determination is made as to whether the minimum difference between candidate data point 5525 and reference line 5555 exceeds a difference threshold. In
Further in the scenario of
A candidate data point of the following distance data is identified, where a minimum difference between the candidate data point and the reference line 5560 is greater than a minimum difference between other data points being compared (sequential points between points 5510 and 5522) and the reference line 5560. That is, a data point of the following distance data between points 5510 and 5522 is identified which is the most different from the reference line 5560. In
Following determination of candidate data point 5526, a determination is made as the whether the minimum difference between candidate data point 5526 and reference line 5560 exceeds a difference threshold. In
Further, the iteratively-defined reference lines are updated to include reference lines which intersect select data points 5525 and 5523, as is shown in
As mentioned earlier, the simplified following distance data is generated by selecting data points from the following distance data, based on differences of the data points to reference lines. In the scenario of
Further, a candidate data point of the following distance data is identified, where a minimum difference between the candidate data point and the reference line 5565 is greater than a minimum difference between other data points being compared (sequential points between points 5522 and 5525) and the reference line 5565. That is, a candidate data point of the following distance data between select data points 5522 and 5525 is identified which is the most different from the reference line 5565. In
Following identification of candidate data point 5526, a determination is made as the whether the minimum difference between candidate data point 5526 and reference line 5565 exceeds a difference threshold. In
Further in the scenario of
Following identification of candidate data point 5527, a determination is made as to whether the minimum difference between candidate data point 5527 and reference line 5570 exceeds a difference threshold. In
Further in the scenario of
Following identification of candidate data point 5528, a determination is made as to whether the minimum difference between candidate data point 5528 and reference line 5575 exceeds a difference threshold. In
Further in the scenario of
Following identification of candidate data point 5529, a determination is made as to whether the minimum difference between candidate data point 5529 and reference line 5580 exceeds a difference threshold. In
In
The process of simplification of following distance data discussed with reference to
The acts of method 5600 are performed for each data series in the following distance data, until all data points are within a distance threshold of a corresponding reference line. In this context, what is meant by “data series” is a sequence of data points, between a first chronological data point of the data series and a last chronological data point of the data series. The following distance data includes at least one data series of following distance data points. With reference to the example of
At 5602, a first data point in the data series and a last data point in the data series are identified as select data points for inclusion in the simplified following distance data. In a first iteration of method 5600, with reference to the example data series of
At 5604, a reference line is defined the first data point and the last data point. In a first iteration of method 5600, with reference to the example data series of
At 5606, a candidate data point is identified between the first data point and the last data point. A minimum difference between the candidate data point and the reference line is greater than a minimum difference between other data points in the data series and the reference line. In a first iteration of method 5600, with reference to the example data series of
At 5608, a determination is made as to whether the minimum difference between the candidate data point and the reference line is within a difference threshold. If the minimum difference is within the difference threshold, the method proceeds to act 5610, where each data point of the data series between the first data point and the second data point is excluded from inclusion in the simplified following distance data (and eventually to act 5620, where the respective iteration is ended). In a first iteration of method 5600, with reference to the example data series of
At 5612, the candidate data point is identified as a select data point for inclusion in the simplified following distance data. In a first iteration of method 5600, with reference to the example data series of
At 5614, a component data series is defined between the first data point in the data series and the select data point. In a first iteration of method 5600, with reference to the example data series of
The component data series and the other component data series as defined in acts 5614 and 5616 are treated as data series in the context of method 5600, and thus method 5600 is iterated again for the component data series and the other component data series. This is illustrated by way of example in
Following distance over time is a useful metric for evaluating driver behavior over time, and taking appropriate action based thereon. In particular, a driver can be associated with a vehicle or vehicles over time, and a following distance record can be established for the driver. In an example, a driver may consistently leave significant and safe following distance from lead vehicles, and thus may not trigger tailgating warnings, and thus may not be flagged by an automated driver coaching system. Such a diligent record may be identified over time, and the driver may be rewarded or recognized for their excellent driving habits. In another example, a driver may historically leave significant and safe following distance from lead vehicles, but over time this following distance may decrease, even if no or few specific tailgating instances are identified (for example as discussed below with reference to
In column 5710, a plurality of driver identifiers are listed. While three drivers are illustrated, any appropriate number of drivers can be listed. Further, any appropriate driver identifiers can be used, such as name, identification number or code, nickname, or any other identifier.
In column 5720, a respective following distance history for each driver is indicated. The respective following distance histories are based on following distance data (simplified or not simplified) as output for a plurality of vehicles, such as in accordance with act 5430 of method 5400. In the illustrated example, the respective following distance is indicated per driver. For example, each vehicle can be associated with a particular driver, such that following distance data for the vehicle indicates following distance for the driver. In other examples, drivers may not consistently use a single vehicle, in which cased following distance for vehicles can be compared (e.g. by at least one processor of a management device or user terminal) to a vehicle schedule, to compile following distance data per driver. In the illustrated example, following distance history is shown as a visual trend, but in alternative implementations following distance can be indicated in any appropriate way, such as numerically or with other visualizations like a bar graph.
In column 5730, an evaluation is indicated for each user, which can be used by an operator or manager to take appropriate action to optimize safety. In the illustrated example, while John Smith is shown in column 5720 to have previously maintained a high (safe) following distance, in recent times this following distance has decreased. John Smith is thus evaluated as being At Risk in column 5730. In response, corrective action can be taken, such as coaching, reminders, or training, to help John Smith return to former safe driving habits. Further in the illustrated example, Jane Doe is shown in column 5720 to have previously maintained a low (unsafe) following distance. However, Jane Doe's following distance habits improved after a previous coaching event (shown as a sudden increase in following distance in column 5720). Thus, Jane Doe's Evaluation in column 5730 is shown as having Improved. Jane Doe can be rewarded for this improvement. Further still in the illustrated example, Bubba Jones is shown in column 5720 to have consistently maintained a high (safe) following distance, in the past and as far as shown in the present data. Bubba Smith can be rewarded for excellent driving practices.
In some implementations, tailgating events can be identified based on the simplified following distance data output at 5430 in method 5400. In particular, for any (or for each) data point in the simplified following distance data, at least one processor (e.g. a processor at a management device or server where the simplified following distance data is received) can determine whether a respective physical distance between the subject vehicle and the other vehicle for the data point is within tailgating distance criteria. If the respective physical distance is outside of the tailgating distance criteria, the at least one processor identifies that the subject vehicle is not tailgating the other vehicle at a time corresponding to the data point (e.g. at a timestamp associated with the data point). On the other hand, if the tailgating criteria are met (e.g. such as whether the respective physical distance is within the tailgating distance criteria), the at least one processor identifies that the subject vehicle is tailgating the other vehicle at a time corresponding to the data point (e.g. at a timestamp associated with the data point).
Such determination of tailgating events and tailgating criteria are discussed earlier with reference to
Data points where tailgating is identified can be logged, labelled, or otherwise flagged as tailgating events. In some implementations, in response to identification that the subject vehicle is tailgating the other vehicle, a tailgating indication is output. For example, an alert or notification can be presented which indicates the tailgating event (such as shown for example in
Identifying tailgating events based on the simplified following distance data generated at 5420 (as opposed to full following distance data generated at 5410) advantageously reduces processing burden associated with identifying tailgating events. In some implementations, identification of tailgating events as discussed above can be performed at (by at least one processor of) a vehicle device positioned at a vehicle, where processing resources tend to be particularly limited. In such implementations, outputting a tailgating indication by the vehicle device can entail transmitting the tailgating indication, by a communication interface of the vehicle device, to a remote management device or server. In other implementations, identification of tailgating events as discussed above can be performed remote from the vehicle device (e.g. at a management device or server which receives the simplified following distance data output at 5430). In this way, processing burden at the vehicle device can be further alleviated, and the management device or server can still reasonably identify tailgating events based on the simplified following distance data generated at 5420 without the transmission burden of transmitting the full following distance data generated at 5410.
At 5802, respective simplified following distance data is received for a plurality of subject vehicles. That is, for each subject vehicle in the plurality of subject vehicles, method 5400 in
At 5804, location data is received by a communication interface for the plurality of subject vehicles. Such location data is captured by a location sensor positioned at each vehicle (e.g. as a location sensor integrated in each vehicle, or included in a telematics device installed to each vehicle, or any other appropriate implementation, such as location module 208 discussed earlier with reference to
At 5806, respective tailgating indications are accessed for the plurality of subject vehicles. Such tailgating indications can be generated as discussed above. In some implementations, tailgating indications for each subject vehicle are generated at each respective subject vehicle, and transmitted to the device performing method 5800. In such implementations, accessing the respective tailgating indications can comprise receiving the respective tailgating indications by a communication interface of the device performing method 5800. In some implementations, respective tailgating indications are determined or generated at the device performing method 5800. In such implementations, accessing the respective tailgating indications can comprise generating the respective tailgating indications by at least one processor of the device performing method 5800 as discussed earlier. In some implementations, tailgating indications for the plurality of vehicles (whether generated at vehicle devices or at a management device or server) can be stored at a non-transitory processor-readable storage medium (such as at a management device or server, or remote database accessible to the management device or server). In such implementations, accessing the respective tailgating indication at 5806 can comprise retrieving or otherwise accessing the stored tailgating indications.
At 5808, for each tailgating indication, the at least one processor associates a location of the respective subject vehicle (to which a given tailgating indication applies) at a time of the tailgating event indicated by the tailgating indication, with a roadway segment corresponding to the location. In this context, a roadway segment can refer to any appropriate delineation of a roadway where vehicles travel, such as an intersection, a section of road between intersections, a highway, a particular lane of a road, etcetera.
At 5810, the at least one processor quantifies tailgating indications for roadway segments of interest. For example, for each roadway segment of interest, the at least one processor can sum a total quantity of tailgating indications. As another example, for each roadway segment of interest, the at least one processor can determine a quantity of tailgating indications that pertain to a certain period of time. As yet another example, for each roadway segment of interest, the at least one processor can determine a rate of tailgating indications per unit of time (e.g. determine number of tailgating indications per hour, or any other appropriate unit of time).
Roadway segments of interest can be determined or indicated in any appropriate way. As one example, an operator can provide user input to the device performing method 5800 (e.g. using any appropriate hardware, such as that discussed with reference to
At 5812, the at least one processor identifies at least one high-risk roadway segment. The at least one high-risk roadway segment is identified where an amount of associated tailgating indications exceeds a tailgating risk threshold. That is, tailgating events for the high-risk roadway segment are identified as occurring to a noteworthy extent. The tailgating risk threshold can comprise any appropriate metric as suitable for a given application. In some implementations, the tailgating risk threshold can comprise a threshold amount of tailgating events within a certain time period. For example, the tailgating risk threshold could comprise 50 tailgating events per calendar day. That is, for a particular roadway segment, if more than 50 tailgating indications are quantified in a given calendar day at 5810, the particular roadway segment is identified as a high-risk roadway segment (at least for the particular calendar day). In some other implementations, the tailgating risk threshold can comprise a threshold amount of tailgating events per unit of time. For example, the tailgating risk threshold could comprise 10 tailgating events per hour. That is, for a particular roadway segment, if more than 10 tailgating indications are quantified within any 60 minute period at 5810, the particular roadway segment is identified as a high-risk roadway segment.
Following distance data and tailgating indications may not be available for every vehicle which travels through a roadway segment of interest. For example, on public roads many vehicles may not collect or report following distance data, or may not be integrated with a system which collects such data. Consequently, quantification of tailgating indications as in act 5810 of method 5800 can be a representative quantification, based on available data. For example, if an estimated 10% of vehicles which travel through a roadway segment of interest collect and report following distance data and/or tailgating indications available to the device which performs method 5800, then a tailgating risk threshold can be adjusted under the assumption that ten times more vehicles actually travel through the roadway segment of interest. In such an example, tailgating events could be assumed to be ten times higher than as indicated by the tailgating indications available to the system which performs method 5800.
At least one indication of any identified high-risk roadway segments can be output, for example via a user interface to an operator or manager as shown in
Identification of high-risk roadway segments can be useful for a manager or other pertinent entities who determine, advise, and/or implement roadway changes and improvements. In particular, increased occurrence of tailgating in an area (compared to other areas or compared to historical tailgating occurrences) is a symptom of vehicle congestion and/or other impedances to smooth traffic flow, which in turn contribute to increased collisions or other dangerous situations. By identifying high-risk roadway segments, action can be taken to reduce risk in these roadway segments. For example, additional lanes could be added to a roadway segment to increase vehicle throughput, thereby reducing congestion and in turn reducing tailgating events. As another example, particular connections or relationships between roadway segments (e.g. intersections) may cause merging or intersecting traffic flows to impede vehicles. For example, an on-ramp to a highway may not be sufficiently long for some vehicles to accelerate to highway speed prior to merging into the flow of highway traffic. This can impede flow of the highway traffic and cause an increase in tailgating events. This could be addressed by increasing length of the on-ramp or increasing length of an acceleration lane adjacent the highway.
In the example of
At 6002, respective simplified following distance data is received for a plurality of subject vehicles. That is, for each subject vehicle in the plurality of subject vehicles, method 5400 in
At 6004, location data is received by a communication interface for the plurality of subject vehicles. Such location data is captured by a location sensor positioned at each vehicle (e.g. as a location sensor integrated in each vehicle, or included in a telematics device installed to each vehicle, or any other appropriate implementation such as location module 208 discussed earlier with reference to
At 6006, for each data point in the simplified following distance data for each subject vehicle of the plurality of subject vehicles, the at least one processor of the device which performs method 6000 associates a location of the respective subject vehicle at a respective time of the data point with a roadway segment corresponding to the location of the respective subject vehicle at the respective time. That is, location, time and following distance for each data point and each subject vehicle are associated, thus representing a dataset indicative of following distance by vehicles across locations and times. Similarly to as discussed earlier, a roadway segment can refer to any appropriate delineation of a roadway where vehicles travel, such as an intersection, a section of road between intersections, a highway, a particular lane of a road, etcetera.
At 6008, the at least one processor of the device which performs method 6000 quantifies following distance for the plurality of vehicles for at least one roadway segment of interest. For example, for each roadway segment of interest, the at least one processor can determine an average or median following distance by subject vehicles of the plurality of subject vehicles which pass through the roadway segment of interest. As another example, for each roadway segment of interest, the at least one processor can determine an average or median following distance by subject vehicles of the plurality of subject vehicles which pass through the roadway segment of interest, within a certain period of time. As yet another example, for each roadway segment of interest, the at least one processor can determine a quantity of time that subject vehicles which pass through the region of interest spend at a following distance below a safe following distance threshold.
Roadway segments of interest can be determined or indicated in any appropriate way, similarly to as discussed above regarding method 5800 in
Optionally, the at least one processor can identify at least one high-risk roadway segment. For example, the at least one high-risk roadway segment can be identified as high-risk when a following distance for the plurality of subject vehicles as quantified at 6008 is below a safe following distance threshold. That is, following distance is determined (on average or cumulatively) to be unsafe, and consequently the roadway segment is identified as high-risk. The safe following distance threshold can comprise any appropriate metric as suitable for a given application, such as those discussed earlier with reference to
Following distance as associated with roadway segments can be presented to a user in any appropriate manner. For example, a user interface akin to that shown in
While the present invention has been described with respect to the non-limiting embodiments, it is to be understood that the invention is not limited to the disclosed embodiments. Persons skilled in the art understand that the disclosed invention is intended to cover various modifications and equivalent arrangements included within the scope of the appended claims. Thus, the present invention should not be limited by any of the described embodiments.
Throughout this specification and the appended claims, infinitive verb forms are often used, such as “to operate” or “to couple”. Unless context dictates otherwise, such infinitive verb forms are used in an open and inclusive manner, such as “to at least operate” or “to at least couple”.
The Drawings are not necessarily to scale and may be illustrated by phantom lines, diagrammatic representations, and fragmentary views. In certain instances, details that are not necessary for an understanding of the exemplary embodiments or that render other details difficult to perceive may have been omitted.
The specification includes various implementations in the form of block diagrams, schematics, and flowcharts. A person of skill in the art will appreciate that any function or operation within such block diagrams, schematics, and flowcharts can be implemented by a wide range of hardware, software, firmware, or combination thereof. As non-limiting examples, the various embodiments herein can be implemented in one or more of: application-specific integrated circuits (ASICs), standard integrated circuits (ICs), programmable logic devices (PLDS), field-programmable gate arrays (FPGAs), computer programs executed by any number of computers or processors, programs executed by one or more control units or processor units, firmware, or any combination thereof.
The disclosure includes descriptions of several processors. Said processors can be implemented as any hardware capable of processing data, such as application-specific integrated circuits (ASICs), standard integrated circuits (ICs), programmable logic devices (PLDS), field-programmable gate arrays (FPGAs), logic circuits, or any other appropriate hardware. The disclosure also includes descriptions of several non-transitory processor-readable storage mediums. Said non-transitory processor-readable storage mediums can be implemented as any hardware capable of storing data, such as magnetic drives, flash drives, RAM, or any other appropriate data storage hardware. Further, mention of data or information being stored at a device generally refers to the data information being stored at a non-transitory processor-readable storage medium of said device.
Siddique, Javed, Mazumder, Joy, Saurav, Shashank, Siddique, Mohammed Sohail, Qiao, Donghao
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
10395540, | Oct 16 2015 | Lytx, Inc. | Proximity event determination with lane change information |
10431089, | Nov 17 2017 | LYTX, INC | Crowdsourced vehicle history |
10496891, | Aug 17 2017 | Harman International Industries, Incorporated | Driver assistance system and method for object detection and notification |
10818109, | May 11 2016 | GOLDMAN SACHS LENDING PARTNERS LLC, AS COLLATERAL AGENT; ALTER DOMUS US LLC, AS COLLATERAL AGENT | Systems and methods for capturing and offloading different information based on event trigger type |
11017244, | Jun 20 2017 | APOLLO INTELLIGENT DRIVING BEIJING TECHNOLOGY CO , LTD ; APOLLO INTELLIGENT DRIVING TECHNOLOGY BEIJING CO , LTD | Obstacle type recognizing method and apparatus, device and storage medium |
11336867, | Dec 19 2013 | Apple Inc. | Method of tracking a mobile device and method of generating a geometrical model of a real environment using a camera of a mobile device |
11373411, | Jun 13 2018 | Apple Inc. | Three-dimensional object estimation using two-dimensional annotations |
11544935, | Feb 26 2020 | Honda Motor Co., Ltd. | System for risk object identification via causal inference and method thereof |
11615628, | Feb 02 2018 | Sony Corporation | Information processing apparatus, information processing method, and mobile object |
11643102, | Nov 23 2020 | Samsara Inc. | Dash cam with artificial intelligence safety event detection |
11683579, | Apr 04 2022 | Samsara Inc. | Multistream camera architecture |
11704910, | Feb 15 2018 | Koito Manufacturing Co., Ltd. | Vehicle detecting device and vehicle lamp system |
11758096, | Feb 12 2021 | Samsara Networks Inc. | Facial recognition for drivers |
11866055, | Nov 12 2021 | Samsara Inc. | Tuning layers of a modular neural network |
11989949, | Mar 31 2023 | Geotab Inc. | Systems for detecting vehicle following distance |
9405970, | Feb 02 2009 | EYESIGHT MOBILE TECHNOLOGIES LTD | System and method for object recognition and tracking in a video stream |
9489635, | Nov 01 2012 | GOOGLE LLC | Methods and systems for vehicle perception feedback to classify data representative of types of objects and to request feedback regarding such classifications |
9734717, | Oct 16 2015 | LYTX, INC | Proximity event determination with lane change information |
20050090983, | |||
20050125121, | |||
20100110176, | |||
20130107051, | |||
20160019791, | |||
20160167514, | |||
20160170487, | |||
20160343145, | |||
20170326981, | |||
20190256105, | |||
20200020121, | |||
20210356696, | |||
20210365696, | |||
20210366144, | |||
20220172396, | |||
20220244142, | |||
20230038842, | |||
20230102113, | |||
20230159031, | |||
20230169793, | |||
20230206466, | |||
20230273033, | |||
20230290002, | |||
20230382380, | |||
CN110745140, | |||
EP3855409, | |||
JP4205825, | |||
KR20120021445, | |||
KR20220119396, | |||
WO2021253245, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Mar 27 2024 | MAZUMDER, JOY | GEOTAB Inc | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 066961 | /0020 | |
Mar 27 2024 | SAURAV, SHASHANK | GEOTAB Inc | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 066961 | /0020 | |
Mar 27 2024 | SIDDIQUE, JAVED | GEOTAB Inc | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 066961 | /0020 | |
Mar 28 2024 | Geotab Inc. | (assignment on the face of the patent) | / | |||
Mar 28 2024 | SIDDIQUE, MOHAMMED SOHAIL | GEOTAB Inc | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 066961 | /0020 | |
Mar 28 2024 | QIAO, DONGHAO | GEOTAB Inc | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 066961 | /0020 |
Date | Maintenance Fee Events |
Mar 28 2024 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Date | Maintenance Schedule |
Dec 10 2027 | 4 years fee payment window open |
Jun 10 2028 | 6 months grace period start (w surcharge) |
Dec 10 2028 | patent expiry (for year 4) |
Dec 10 2030 | 2 years to revive unintentionally abandoned end. (for year 4) |
Dec 10 2031 | 8 years fee payment window open |
Jun 10 2032 | 6 months grace period start (w surcharge) |
Dec 10 2032 | patent expiry (for year 8) |
Dec 10 2034 | 2 years to revive unintentionally abandoned end. (for year 8) |
Dec 10 2035 | 12 years fee payment window open |
Jun 10 2036 | 6 months grace period start (w surcharge) |
Dec 10 2036 | patent expiry (for year 12) |
Dec 10 2038 | 2 years to revive unintentionally abandoned end. (for year 12) |