Patent · US Active

Three-dimensional (3D) pose estimation from a monocular camera

US10929654B2 · kind B2 · utility

10Cited by

1References

21Claims

0Family size

Assignee

NVIDIA Corporation · US

Inventors

Umar Iqbal · San Jose, US
Pavlo Molchanov · Santa Clara, US
Thomas Breuel · Sparks, US
Jan Kautz · Lexington, US

Key dates

Filing date	Mar 1, 2019
Grant date	Feb 23, 2021
Priority date	—
Expiry date	Jul 25, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG06T2207/30196
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Estimating a three-dimensional (3D) pose of an object, such as a hand or body (human, animal, robot, etc.), from a 2D image is necessary for human-computer interaction. A hand pose can be represented by a set of points in 3D space, called keypoints. Two coordinates (x,y) represent spatial displacement and a third coordinate represents a depth of every point with respect to the camera. A monocular camera is used to capture an image of the 3D pose, but does not capture depth information. A neural network architecture is configured to generate a depth value for each keypoint in the captured image, even when portions of the pose are occluded, or the orientation of the object is ambiguous. Generation of the depth values enables estimation of the 3D pose of the object.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.