A system configured for tracking a human hand, e.g., in an interactive space, may comprise a distancing device, one or more physical processors, and/or other components. The distancing device may be configured to generate output signals conveying position information. The position information may include positions of surfaces of real-world objects, including surfaces of a human hand. A group of points may be identified that lie on a candidate surface which is a candidate for being the surface of a hand. The edges of the candidate surface may be detected. The points furthest away from the closest edge may be determined. A model may be created by connecting at least some of these points. sections of the model are identified that may be a digit of the hand. Per section, fingertips and metacarpophalangeal joints may be identified. The model may be analyzed to determine whether the candidate surface is the surface of a hand.
|
1. A system configured for identifying and tracking a human hand, the system comprising:
a distancing device configured to generate output signals conveying position information, the position information including positions of surfaces of real-world objects, wherein the position information comprises point cloud information, the point cloud information including positions of a set of points that lie on the surfaces of the real-world objects, such that the positions are on or near the surfaces of the real-world objects; and
one or more physical processors configured by machine-readable instructions to:
obtain the position information derived from the output signals of the distancing device;
identify a group of points that is a subset of the set of points such that the group of points lie on a candidate surface which is a candidate for being a surface of a hand;
detect a set of surface edges of the candidate surface, wherein detection of the set of edges is based on depth information of individual points in the group of points;
determine a subset of points that is a subset of the group of points, wherein the determination is based on a relation between individual points in the group of points and one or more surface edges from the set of surface edges;
create a model that connects neighboring points in the determined subset of points;
identify one or more sections of the model such that an individual section is a candidate section which is a candidate for being a finger or thumb of the hand;
identify tips and connective joints for individual candidate sections from the one or more candidate sections such that an individual tip is a candidate tip which is a candidate for being a fingertip of an individual finger or thumb and an individual connective joint is a candidate connective joint which is a candidate for being a connective joint that connects the individual finger or thumb to the hand; and
determine whether the candidate surface is the surface of the hand based on an analysis of the identified tips and connective joints.
16. A method of identifying and tracking a human hand, the method being implemented in a computer system comprising one or more physical processors and non-transitory storage media storing machine-readable instructions, the method comprising:
obtaining position information, the position information being derived from output signals of a distancing device, the position information including positions of surfaces of real-world objects, wherein the position information comprises point cloud information, the point cloud information including positions of a set of points that lie on the surfaces of the real-world objects, such that the positions are on or near the surfaces of the real-world objects;
identifying a group of points that is a subset of the set of points such that the group of points lie on a candidate surface which is a candidate for being a surface of a hand;
detecting a set of surface edges of the candidate surface, wherein detection of the set of edges is based on depth information of individual points in the group of points;
determining a subset of points that is a subset of the group of points, wherein the determination is based on a relation between individual points in the group of points and one or more surface edges from the set of surface edges;
creating a model that connects neighboring points in the determined subset of points;
identifying one or more sections of the model such that an individual section is a candidate section which is a candidate for being a finger or thumb of the hand;
identifying tips and connective joints for individual candidate sections from the one or more candidate sections such that an individual tip is a candidate tip which is a candidate for being a fingertip of an individual finger or thumb and an individual connective joint is a candidate connective joint which is a candidate for being a connective joint that connects the individual finger or thumb to the hand; and
determining whether the candidate surface is the surface of the hand based on an analysis of the identified tips and connective joints.
2. The system of
determine, for individual points in the group of points, which individual surface edges in the set of surface edges are closest to the individual points such that individual points are associated with closest individual surface edges; and
determine, for individual points in the group of points, distances to the associated closest individual surface edges;
wherein determination of the subset of points is performed such that individual points in the subset of points have equal or greater distances to the associated closest individual surface edges than neighboring points in the group of points.
3. The system of
4. The system of
determine one or more distances between the identified tips and/or between the identified connective joints;
wherein the analysis of the identified tips and connective joints is based on the determined one or more distances.
5. The system of
determine directional vectors for the individual candidate sections,
wherein the analysis of the identified tips and connective joints is based on comparisons of the determined directional vectors.
6. The system of
7. The system of
8. The system of
9. The system of
10. The system of
11. The system of
12. The system of
a light source configured to emit light; and
an optical element the optical element being configured to reflect light emitted from the light source in a perceived three-dimensional light field within a field-of-view of a user of the interactive space.
13. The system of
generate views of virtual content to be perceived within the field-of-view of the user; and
facilitate user interaction with the virtual content based on one or more of a position of the hand, a gesture of the hand, and a movement of the hand.
15. The system of
17. The method of
determining, for individual points in the group of points, which individual surface edges in the set of surface edges are closest to the individual points such that individual points are associated with closest individual surface edges; and
determining, for individual points in the group of points, distances to the associated closest individual surface edges,
wherein determination of the subset of points is performed such that individual points in the subset of points have equal or greater distances to the associated closest individual surface edges than neighboring points in the group of points.
18. The method of
determining directional vectors for the individual candidate sections, wherein the analysis of the identified tips and connective joints is based on comparisons of the determined directional vectors.
19. The method of
20. The method of
21. The method of
22. The method of
23. The method of
24. The method of
25. The method of
emitting light, by a light source; and
reflecting light emitted by the light source in a perceived three-dimensional light field within a field-of-view of a user of the interactive space.
26. The method of
generating views of virtual content to be perceived within the field-of-view of the user; and
facilitating user interaction with the virtual content based on one or more of a position of the hand, a gesture of the hand, and a movement of the hand.
27. The method of
28. The method of
29. The method of
determining one or more distances between the identified tips and/or between the identified connective joints;
wherein the analysis of the identified tips and connective joints is based on the determined one or more distances.
30. The method of
|
The system and methods described herein relate to identifying and tracking a human hand in an interactive space based on the approximated center-lines of the digits of a user's hand(s), as they are sensed within the interactive space.
There are many types of vision systems that provide hand-based and gesture-based human-computer interaction. In particular, wearable technology such as head mounted displays (HMD) are becoming more popular. Features of the human body may be tracked to facilitate gesture-based input into, e.g., an augmented reality environment.
The systems and methods described herein facilitate identifying and tracking a human hand, e.g., in an interactive space. In some implementations, a hand may be detected by identifying individual digits of the hand. A distancing device (e.g., a depth sensor) may be configured to generate output signals conveying position information of the hand. The position information may include positions of surfaces of real-world objects, including a user's hands. A group of points may be identified as a candidate surface, which is a candidate for being identified as the surface of a hand. Subsets of this group of points may be identified as a candidate for being a digit. Individual subsets of the group of points may be processed to determine which ends of the identified digits are the fingertips and which ends are metacarpophalangeal joints that connect a digit to the rest of the hand. Based on this information, the system may determine whether the candidate surface is a hand. The position information of the hand may be obtained based on a sampling rate of the distancing device.
In some applications, the presence of an arm is a useful feature for hand detection in the system described herein. For example, a set of points that are a candidate for being the surface of a hand will not float in the air but should be connected to points forming the surface of an arm. However, watches, jewelry, sleeves, and/or other wearable items may prevent accurate detection of the arm surface points. In such cases, hand detection may be performed by processing the group of points that is a hand candidate, and analyzing it without assuming or relying on the presence of an arm.
The interactive space may include one or more of a virtual reality environment, an augmented reality environment, and/or other interactive spaces. An “augmented reality environment,” as used herein, may refer to a space that represents a virtual environment that may be superimposed over a perspective of a physical real-world environment around a user. An augmented reality environment may include attributes of a virtual environment, including virtual objects superimposed over portions of the physical environment. A virtual reality environment may refer to a space that includes the virtual environment. The terms “space” and “environment” may be used interchangeably herein.
It is noted that while one or more features and/or functions of the system and method presented here may be directed to interactive spaces, this is for illustrative purposes only and is not to be considered limiting. For example, in some implementations, one or more features and/or functions of the system and method presented may be similarly applied to virtual environments, entertainment and infotainment systems, gesture control of mobile devices, in-vehicle computer systems, and/or other systems and/or environments.
These and other objects, features, and characteristics of the present disclosure, as well as the methods of operation and functions of the related components of structure and the combination of parts and economies of manufacture, will become more apparent upon consideration of the following description and the appended claims with reference to the accompanying drawings, all of which form a part of this specification, wherein like reference numerals designate corresponding parts in the various figures. It is to be expressly understood, however, that the drawings are for the purpose of illustration and description only and are not intended as a definition of the any limits. As used in the specification and in the claims, the singular form of “a”, “an”, and “the” include plural referents unless the context clearly dictates otherwise.
In some implementations, user input may comprise gesture-based input and/or other input. In some implementations, gesture-based input may be based on identifying and tracking of one or more human features of a user. By way of non-limiting illustration, gesture-based input may be based on tracking one or more of a hand, an arm, a torso, and/or other features of a user. By identifying and tracking a user's hands, gestures including one or more of reaching, grabbing, releasing, touching, swiping, pointing and/or other gestures may be identified. In some implementations, simulated manipulation of virtual content in an interactive space may be based on user input and/or other input. The identified gestures may be provided as input for simulating manipulation of virtual content within a user's field-of-view. For example, an identified gesture may be correlated to a perceived position of a virtual object within a user's field-of-view. The virtual object may be configured to react to the gesture in an interactive manner.
System 100 may include one or more of one or more physical processors 104, one or more distancing devices 120, one or more light sources 122, one or more optical elements 124, electronic storage 119, and/or other components. In some implementations, one or more components of system 100 may be incorporated in a head-mounted display (HMD), and/or other devices. By way of non-limiting illustration, one or more optical elements 124 may comprise at least part of a visor of an HMD. An optical element may comprise one or more reflective, or partially reflective surfaces on a visor. An optical element may be formed from a reflective or partially reflective material. A reflective and/or partially reflective material may comprise one or more of cyclic olefin copolymer (COC) such as ZEONEX®, polycarbonate, polymethyl methacrylate (PMMA), and/or other materials.
In some implementations, individual light sources of one or more light sources 122 may comprise one or more of a liquid crystal display (LCD), a light emitting diode (LED) display, surface-conduction electron-emitter display (SED), a plasma display, an organic light emitting diode (OLED) display, a Micro-Electro-Mechanical Systems (MEMS) mirror based display, liquid crystal on silicon (LCoS) display, digital light projector (DLP), a laser, an RGB laser, and/or other light sources. An individual light source may comprise an array of addressable optical components. Individual optical components may be individually controllable to impact one or more aspects of light to create digital images. By way of non-limiting illustration, individual optical components of a display may be individually controllable to impact one or more of transmissivity, brightness, generation, reflection, refraction, and/or other aspects of light to create images.
A set of components comprising at least one light source 122 and at least one optical element 124 may be referred to herein as an “image-forming component.” An image-forming component may be configured to emit light rays, via a light source, forming images of virtual content. When headset 102 is installed on the head of the user, the light rays may be provided, via an optical element 124, to an eye of the user. The images formed by the light rays may be superimposed over views of the real world to create an interactive space.
In some implementations, one or more light sources 122 may be positioned relative to one or more optical elements 124, such that light emitted from one or more light sources 122 may be reflected by one or more optical elements 124 to one or more eyes of the user. The user may perceive the reflected light as virtual content in a perceived three-dimensional light field within their field-of-view. In some implementations, control of position and/or generation of light from individual light sources 122 may impact one or more aspects of a perceived three-dimensional light field and/or virtual content within the perceived the three-dimensional light field. One or more aspects of a perceived three-dimensional light field may include one or more of a perceived distance of the three-dimensional light field from the user, a depth of the perceived three-dimensional light field, and/or other aspects of the three-dimensional light field. One or more aspects of virtual content within a three-dimensional light field may include one or more of a perceived distance of the virtual content from a user, a depth of the virtual content, a size of the virtual content, and/or other aspects of the virtual content. For example, in some implementations, moving a position of an individual light source closer to an optical element may cause virtual content to be perceived closer to the user; while moving the position of the individual light source farther from the optical element may cause the virtual content to be perceived farther from the user.
By way of non-limiting illustration,
Returning to
In some implementations, individual distancing devices of one or more distancing devices 120 may be configured to generate output signals such that the position information is organized in a grid, e.g., a regular grid. In some implementations, the position information may form a regular grid of points for which depth information has been captured and/or generated. By way of non-limiting example,
In some implementations, a position of a surface may be expressed as one or more of distance from an individual distancing device, an angle from reference line (e.g., a horizontal line), a three-dimensional spatial coordinate in a coordinate system, and/or other information. For example, in a three-dimensional Cartesian coordinate system, a spatial coordinate may be expressed as individual values of one or more of an x-component, y-component, z-component, and/or other components.
In some implementations, position information may comprise point cloud information. Point cloud information may include positions of points in individual sets of points, referred to as “point clouds.” In some implementations, individual points in a point cloud may have real-world spatial coordinates corresponding to individual positions of individual surfaces in the real-world. A set of points may correspond to positions of a set of surfaces within a range of vision of a distancing device. Individual points may be associated with individual time stamps. An individual time stamp may correspond to an individual point in time when an individual a position of an individual surface may have been measured (e.g., based on a sampling rate of the distancing device). It is noted that one or more references to “a surface” or “surfaces” may refer to “a point” or “points,” respectively, within a point cloud that lie on the surfaces and convey position of the surfaces in the real-world. In some implementations, a set of surfaces and/or a set of points may form a contiguous surface. In some implementations, a set of surfaces and/or a set of points may make up a contiguous object.
In some implementations, system 100 may use a single distancing device 120, which may generate output signals originating from a single point-of-view. In such a case, part of a hand may occlude one or more fingers. As a result, the point cloud for such a hand may appear to include fewer than five digits. In some implementations, system 100 may use multiple distancing devices 120, e.g. having different points-of-view. By combining the output signals generated from multiple distancing devices 120, a richer point cloud containing more information may be derived. In some implementations, such types of point clouds may have less occlusion than single-device single-point-of-view point clouds.
The one or more physical processors 104 may be configured by machine-readable instructions 106. Executing machine-readable instructions 106 may cause one or more physical processors 104 to facilitate tracking a human hand in an interactive space. The machine-readable instructions 106 may include one or more of a control component 108, an input component 110, a hand candidate component 111, an edge component 112, a distance component 113, a center point component 114, a model component 115, a digit candidate component 116, a digit orientation component 117, a hand analysis component 118, and/or other components.
In some implementations, control component 108 may be configured to control one or more components of system 100. In some implementations, control component 108 may be configured to control individual distancing devices of one or more distancing devices 120, and/or other components. In some implementations, control of individual distancing devices of one or more distancing devices 120 may include control of one or more of a sampling rate at which individual distancing devices of one or more distancing devices 120 generate output signals, a sampling period in which individual distancing devices of one or more distancing devices 120 generate output signals, a resolution and/or granularity of one or more distancing devices 120, a horizontal and/or vertical width of the angle of capture and/or generation of one or more distancing devices 120, and/or other control of individual distancing devices of one or more distancing devices 120.
In some implementations, control component 108 may be configured to control individual light sources of one or more light sources 122, and/or other components. In some implementations, control of individual light sources of one or more light sources 122 may include one or more of control of light emission in accordance with one or more instances of virtual content to be generated and/or presented in an interactive space, and/or other types of control. By way of non-limiting illustration, control component 108 may be configured to control a first light source to generate a view of an instance of a first virtual object.
Input component 110 may be configured to obtain position information and/or other information. Position information may be derived from output signals of individual distancing devices of one or more distancing devices 120, from electronic storage 119, from remote storage, and/or from other sources. In some implementations, position information may be obtained at a sampling rate of individual distancing devices of one or more distancing devices 120. Point cloud information may include positions, depths, and/or distances of a set of points that lie on the surfaces of one or more real-world objects. For example, the positions may be on or near surfaces of real-world objects, including but not limited to a user's hands. In some implementations, point cloud information may be three-dimensional positional information.
Referring to
For non-limiting illustrative purposes, first object 200 may comprise a soda cup; and second object 201 may comprise a human hand. System 100 may be configured to obtain position information derived from sensor output (see, e.g., input component 110). System 100 may be configured to determine clusters of points and/or surfaces. The clusters may include one or more of a first cluster 204, a second cluster 206, a third cluster 210, a fourth cluster 212, a fifth cluster 214, a sixth cluster 216, a seventh cluster 218, an eighth cluster 220, and/or other clusters.
System 100 may be configured to determine groups of clusters based on proximity, similarity in size and/or shape, similarity to predetermined patterns and/or templates, similarity and/or range of colors, and/or based on other characteristics. Groups of clusters may include one or more clusters of points and/or surfaces that may make up individual contiguous objects. The groups of clusters may include one or more of a first cluster group 208, a second cluster group 222, and/or other groups of clusters. The first cluster group 208 may include one or more of first cluster 204, second cluster 206, and/or other clusters. The second cluster group 222 may include one or more of third cluster 210, fourth cluster 212, fifth cluster 214, sixth cluster 216, seventh cluster 218, eighth cluster 220, and/or other clusters.
System 100 may be configured to associate, for the individual groups of clusters, the individual clusters in the individual cluster groups with individual anatomical features, including but not limited to features of a hand. By way of non-limiting illustration, first cluster 204, eighth cluster 220, and/or other clusters may be associated with a first hand-feature (i.e. a feature related to a human hand); second cluster 206, fourth cluster 212, fifth cluster 214, sixth cluster 216, seventh cluster 218, and/or other clusters may be associated with a second hand-feature; and third cluster 210 may be associated with a third hand-feature. The first hand-feature may include a palm of a hand; the second hand-feature may include a finger of a hand; and the third hand-feature may include a thumb of a hand.
Referring to
In some implementations, identifying the group of points may be based on one or more of an estimated surface area of the group of points, an estimated volume of the group of points, a distance from the distancing device to the group of points, a range of distances from the distancing device to the group of points, and a range of ratios of the estimated surface area of the group of points and the estimated volume of the group of points. For example, a typical range of surface areas for a human hand in a particular pose (say, a fist, or a hand with all digits pointing outwards like a star, etc.) may be known in advance, e.g., as based on statistical analysis and/or measurements. Similarly, a typical volume of a hand (or of a bounding box around a hand, with the hand in a particular pose), may be known in advance. Additionally, for a distancing device 120 worn in a predetermined manner (e.g., as part of a HMD) by a user of known and/or estimated dimensions, the maximum distance of the user's hands to the distancing device 120 may be known and/or estimated in advance. For example, the system can be configured to determine that objects at 5 or more feet from the user's head are not the user's hands.
In some implementations, identifying the group of points is based on a range of (skin) colors of the group of points. In some implementations, distancing device 120 may be configured to generate the output signals at a first moment in time. For example, identifying the group of points may be based on a location compared to a previously identified group of points derived from output signals generated at a second moment in time, the second moment in time occurring prior to the first moment in time. For example, a user's hands may be expected to move at typical human speeds and are, e.g., unlikely to travel across the field-of-view of a distancing device or user in a manner that suggests that the locations of the same hand in subsequently captured and/or generated frames (at, say, a 30 frames-per-second rate) are many feet apart in the real world.
Edge component 112 may be configured to detect a set of surface edges of the candidate surface, wherein detection of the set of edges is based on depth information of individual points in the group of points. By way of non-limiting example,
By way of non-limiting example,
Referring to
Referring to
In some implementations, determining the subset of points is performed such that the neighboring points of an individual point include one or more of a first point to the left of the individual point, a second point to the right of the individual point, a third point above the individual point, and a fourth point below the individual point, as viewed from a point of view originating at the distancing device. By way of non-limiting example,
Referring to
By way of non-limiting example,
Digit candidate component 116 may be configured to identify one or more sections of the model created by model component 115 such that an individual section is a candidate section which is a candidate for being a finger or thumb of the hand. In some implementations, identifications by digit candidate component 116 may be based on the size of a section, on the shape of a section, on the distance between ends of the section, and/or on other characteristics. By way of non-limiting example,
In some implementations, a model created by model component 115 includes one or more sections that are candidates for being a digit of a hand, without including a section for the palm of the hand, the back of the hand, and/or other parts of the hand connecting different digits. In other words, in some implementations, a model created by model component 115 may include only those sections which are candidates for being a digit of a hand.
Referring to
In some implementations, digit orientation component 117 may be configured to perform an individual identification based on a single (candidate) digit. In some implementations, digit orientation component 117 may be configured to perform an individual identification based on relative characteristics (e.g., position, direction, size, shape, etc.) of multiple (candidate) digits. For example, based on knowledge of human anatomy, the left-most end of section 86 in
In some implementations, hand analysis component 118 may be configured to determine whether the candidate surface is the surface of the hand based on an analysis of the identified tips and connective joints. Hand analysis component 118 may be configured to analyze the identified tips and connective joints is based on determined distances between the identified tips and/or between the identified connective joints.
In some implementations, the analysis of the identified tips and connective joints includes grouping up to five candidate sections together as part of the same hand.
In some implementations, digit orientation component 117 may be configured to determine one or more two-dimensional and/or three-dimensional directional vectors for individual digits (or candidate sections). For example, a directional vector may be based on the fingertip and the connective joint of a digit. In some implementations, a directional vector may be based on the connective joint and the estimated location of either the proximal interphalangeal joint (for a finger) or the interphalangeal joint of a thumb. In some implementations, hand analysis component 118 may be configured such that the analysis of the identified tips and connective joints is based on comparisons of directional vectors as determined by digit orientation component 117. Based on known ranges of motion for human hands, and known possible relative positions of the digits of a hand, hand analysis component 118 may determine whether the candidate digits are plausibly connected to the same hand.
In some implementations, comparisons by hand analysis component 118 may be based on dot products of pairs of the determined directional vectors, in particular for neighboring fingers. For example, a dot product of greater than 0.6, 0.7, 0.8, or another threshold may indicate that a pair of candidate digits appear to roughly point in the same direction. For example, a dot product of less than 0.5, 0.4, 0.3, or another threshold may indicate that a pair of candidate digits appear to roughly point in opposite directions, potentially in a manner that could be considered as anatomically unlikely or otherwise unusual.
In some implementations, known shapes, forms, sizes and/or other features of a human hand may be determined on a per-user-basis. For example, individual users may be prompted to position one or more hands within a field-of-view of a distancing device so that the distancing device may sample surfaces of the individual user's own hands. The position information derived from sensor output measuring the individual user's own hands may be used to determined one or more of an over shape, form, sizes, dimensions, and/or other features of the individual user's own hands. In some implementations, the user may be prompted to position their hands in various orientations (e.g., palm up, palm down, palm to the side, fingers in a grip, fingers released, etc.). Dimensions may include one or more of individual lengths of individual fingers, individual lengths of individual thumbs, a width of the hand at the knuckles, a width of the hand at the palm, a surface area of the palm, and/or other dimensions. The features of the individual user's own hands and/or averages (e.g., averaged over different orientations) may be used as one or more canonical hand models that one or more of identifications of a hand (including, for example, whether it may be a right hand or left hand), identifications of individual hand features, and/or determination of feature positions of individual hand features may be based on.
In some implementations, determining individual positions of individual hand features of a human hand may be based on anatomical relationships between hand features of a human hand, one or more canonical hand models, and/or other information. Anatomical relationships may comprise one or more of a directional relationship between hand features, a distance relationship between hand features, an orientation relationship between hand features, and/or other relationships. Anatomical relationships between hand features may follow one or more known conventions such that determining a feature position of one hand feature from a feature position of one or more other hand features may be possible.
By way of non-limiting illustration, given a feature position of a first hand-feature of a hand, a feature position of a second hand-feature of the hand may be determined based on one or more of a range of distances from the feature position of the first hand-feature that may be anatomically possible to correspond to the feature position of the second hand-feature, one or more directions from the feature position of the first hand-feature that may be anatomically possible to point to the feature position of the second hand-feature, and/or other information. By way of non-limiting illustration, given a feature position of a wrist of a hand, a feature position of a thumb of the hand may be determined based on one or more of a range of distances from the wrist position that may be anatomically possible to correspond to the feature position of the thumb, one or more directions from the feature position of the wrist that may be anatomically possible to point to the feature position of the thumb, and/or other information.
In some implementations, determining a feature position of a first hand-feature may include an initial determination of an estimated feature position of the first hand feature. An estimated feature position of the first hand-feature may be initially determined from position information derived from sensor output that conveys positions of surfaces of a hand.
Returning to
The external resources 126 may include sources of information, hosts, external entities participating with system 100, providers of virtual content, and/or other resources. In some implementations, some or all of the functionality attributed herein to external resources 126 may be provided by resources included in system 100.
Processor(s) 104 may include and/or have access to electronic storage 119, and/or other components. Processor(s) 104 may include communication lines or ports to enable the exchange of information with a network, electronic storage 119, and/or other components of system 100. Illustration of processor(s) 104 in
Electronic storage 119 may comprise electronic storage media that electronically stores information. The electronic storage media of electronic storage 119 may include one or both of system storage that is provided integrally (i.e., substantially non-removable) with processor(s) 104 and/or removable storage that is removably connectable to processor(s) 104 via, for example, a port or a drive. A port may include a USB port, a firewire port, and/or other port. A drive may include a disk drive and/or other drive. Electronic storage 119 may include one or more of optically readable storage media (e.g., optical disks, etc.), magnetically readable storage media (e.g., magnetic tape, magnetic hard drive, floppy drive, etc.), electrical charge-based storage media (e.g., EEPROM, RAM, etc.), solid-state storage media (e.g., flash drive, etc.), and/or other electronically readable storage media. Electronic storage 119 may include one or more virtual storage resources (e.g., cloud storage, a virtual private network, and/or other virtual storage resources). Electronic storage 119 may store software algorithms, information determined by processor(s) 104, information received from other components of system 100, and/or other information that enables processor(s) 104 to function as described herein.
Processor(s) 104 is configured to provide information-processing capabilities. As such, processor(s) 104 may include one or more of a digital processor, an analog processor, a digital circuit designed to process information, an analog circuit designed to process information, a state machine, and/or other mechanisms for electronically processing information. Although processor(s) 104 is shown in
It should be appreciated that although components 108, 110-118 are illustrated in
In some implementations, method 900 may be implemented in a computer system comprising one or more of non-transitory electronic storage storing machine-readable instructions, one or more processing devices (e.g., a physical processor, a digital processor, an analog processor, a digital circuit designed to process information, an analog circuit designed to process information, a state machine, and/or other mechanisms for electronically processing information), one or more light sources, one or more distancing devices, one or more optical elements, and/or other components. The one or more processing devices may include one or more devices executing some or all of the operations of method 900 in response to instructions stored electronically on electronic storage media. The one or more processing devices may include one or more devices configured through hardware, firmware, and/or software to be specifically designed for execution of one or more of the operations of method 900.
At an operation 902, position information may be obtained. The position information may be derived from output signals of a distancing device. The position information may include positions of surfaces of real-world objects. The position information may comprise point cloud information. The point cloud information may include positions of a set of points that lie on the surfaces of the real-world objects, such that the positions are on or near the surfaces of the real-world objects. In some embodiments, operation 902 is performed by an input component the same as or similar to input component 110 (shown in
At an operation 904, a group of points is identified that is a subset of the set of points such that the group of points lie on a candidate surface which is a candidate for being a surface of a hand. In some embodiments, operation 904 is performed by a hand candidate component the same as or similar to hand candidate component 111 (shown in
At an operation 906, a set of surface edges of the candidate surface is detected. Detection of the set of edges is based on depth information of individual points in the group of points. In some embodiments, operation 906 is performed by an edge component the same as or similar to edge component 112 (shown in
At an operation 908, it is determined, for individual points in the group of points, which individual surface edges in the set of surface edges are closest to the individual points such that individual points are associated with closest individual surface edges. In some embodiments, operation 908 is performed by a distance component the same as or similar to distance component 113 (shown in
At an operation 910, it is determined, for individual points in the group of points, distances to the associated closest individual surface edges. In some embodiments, operation 910 is performed by a distance component the same as or similar to distance component 113 (shown in
At an operation 912, a subset of points is determined that is a subset of the group of points, wherein the determination is based on a relation between individual points in the group of points and one or more surface edges from the set of surface edges. In some implementations, a subset of points is determined that is a subset of the group of points such that individual points in the subset of points have equal or greater distances to the associated closest individual surface edges than neighboring points in the group of points. In some embodiments, operation 912 is performed by a center point component the same as or similar to center point component 114 (shown in
At an operation 914, a model is created that connects neighboring points in the determined subset of points. In some embodiments, operation 914 is performed by a model component the same as or similar to model component 115 (shown in
At an operation 916, one or more sections of the model are identified such that an individual section is a candidate section which is a candidate for being a finger or thumb of the hand. In some embodiments, operation 916 is performed by a digit candidate component the same as or similar to digit candidate component 116 (shown in
At an operation 918, tips and connective joints are identified for individual candidate sections from the one or more candidate sections such that an individual tip is a candidate tip which is a candidate for being a fingertip of an individual finger or thumb and an individual connective joint is a candidate connective joint which is a candidate for being a connective joint that connects the individual finger or thumb to the hand. In some embodiments, operation 918 is performed by a digit orientation component the same as or similar to digit orientation component 117 (shown in
At an operation 920, it is determined whether the candidate surface is the surface of the hand based on an analysis of the identified tips and connective joints. In some embodiments, operation 920 is performed by a hand analysis component the same as or similar to hand analysis component 118 (shown in
Although the disclosed technology has been described in detail for the purpose of illustration based on what is currently considered to be the most practical and preferred implementations, it is to be understood that such detail is solely for that purpose and that the disclosure is not limited to any particular implementations, but, on the contrary, is intended to cover modifications and equivalent arrangements that are within the spirit and scope of the appended claims. For example, it is to be understood that the present disclosure contemplates that, to the extent possible, one or more features of any embodiment can be combined with one or more features of any other embodiment.
Patent | Priority | Assignee | Title |
10698496, | Sep 12 2016 | Qualcomm Incorporated | System and method for tracking a human hand in an augmented reality environment |
11244154, | Dec 23 2019 | SENSETIME INTERNATIONAL PTE. LTD. | Target hand tracking method and apparatus, electronic device, and storage medium |
Patent | Priority | Assignee | Title |
8274535, | Jul 24 2000 | Qualcomm Incorporated | Video-based image control system |
8351651, | Apr 26 2010 | Microsoft Technology Licensing, LLC | Hand-location post-process refinement in a tracking system |
9304594, | Apr 12 2013 | Microsoft Technology Licensing, LLC | Near-plane segmentation using pulsed light source |
9469558, | Oct 01 2012 | D.C. Water & Sewer Authority; Hampton Roads Sanitation District | Method and apparatus for maximizing nitrogen removal from wastewater |
9665767, | Feb 28 2011 | AIC INNOVATIONS GROUP, INC | Method and apparatus for pattern tracking |
9772679, | Aug 14 2013 | Amazon Technologies, Inc.; Amazon Technologies, Inc | Object tracking for device input |
20030185445, | |||
20110080490, | |||
20110142353, | |||
20120319945, | |||
20130278501, | |||
20140106816, | |||
20140337786, | |||
20150253863, | |||
20150253864, | |||
20170154214, | |||
20170235143, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Oct 23 2017 | Meta Company | (assignment on the face of the patent) | / | |||
Nov 14 2017 | GRIBETZ, YISHAI | Meta Company | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 044478 | /0799 | |
Feb 15 2018 | Meta Company | VENTURE LENDING & LEASING VII, INC | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 045349 | /0313 | |
Feb 15 2018 | Meta Company | VENTURE LENDING & LEASING VIII, INC | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 045349 | /0313 | |
Oct 12 2018 | K&L GATES LLP | META VIEW, INC | RELEASE BY SECURED PARTY SEE DOCUMENT FOR DETAILS | 050110 | /0688 | |
Oct 12 2018 | Meta Company | K&L GATES LLP | CORRECTIVE ASSIGNMENT TO CORRECT THE INCORRECT APPL NO 15 587,105 PREVIOUSLY RECORDED AT REEL: 047240 FRAME: 0864 ASSIGNOR S HEREBY CONFIRMS THE SECURITY INTEREST | 048964 | /0953 | |
Oct 12 2018 | Meta Company | K&L GATES LLP | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 047240 | /0864 | |
Dec 21 2018 | VENTURE LENDING & LEASING VIII, INC | META VIEW, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 049038 | /0277 | |
Dec 21 2018 | VENTURE LENDING & LEASING VII, INC | META VIEW, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 049038 | /0277 | |
Nov 19 2021 | META VIEW, INC | CAMPFIRE 3D, INC | CHANGE OF NAME SEE DOCUMENT FOR DETAILS | 066422 | /0129 | |
Apr 11 2024 | CAMPFIRE 3D, INC | Qualcomm Incorporated | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 067209 | /0066 |
Date | Maintenance Fee Events |
Oct 23 2017 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Nov 06 2017 | SMAL: Entity status set to Small. |
Aug 31 2022 | M2551: Payment of Maintenance Fee, 4th Yr, Small Entity. |
Date | Maintenance Schedule |
Mar 12 2022 | 4 years fee payment window open |
Sep 12 2022 | 6 months grace period start (w surcharge) |
Mar 12 2023 | patent expiry (for year 4) |
Mar 12 2025 | 2 years to revive unintentionally abandoned end. (for year 4) |
Mar 12 2026 | 8 years fee payment window open |
Sep 12 2026 | 6 months grace period start (w surcharge) |
Mar 12 2027 | patent expiry (for year 8) |
Mar 12 2029 | 2 years to revive unintentionally abandoned end. (for year 8) |
Mar 12 2030 | 12 years fee payment window open |
Sep 12 2030 | 6 months grace period start (w surcharge) |
Mar 12 2031 | patent expiry (for year 12) |
Mar 12 2033 | 2 years to revive unintentionally abandoned end. (for year 12) |