Three-dimensional (3D) pose estimation from a monocular camera
US10929654B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 1, 2019 |
| Grant date | Feb 23, 2021 |
| Priority date | — |
| Expiry date | Jul 25, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06T2207/30196
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Estimating a three-dimensional (3D) pose of an object, such as a hand or body (human, animal, robot, etc.), from a 2D image is necessary for human-computer interaction. A hand pose can be represented by a set of points in 3D space, called keypoints. Two coordinates (x,y) represent spatial displacement and a third coordinate represents a depth of every point with respect to the camera. A monocular camera is used to capture an image of the 3D pose, but does not capture depth information. A neural network architecture is configured to generate a depth value for each keypoint in the captured image, even when portions of the pose are occluded, or the orientation of the object is ambiguous. Generation of the depth values enables estimation of the 3D pose of the object.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.