Monocular 3D object localization from temporal aggregation
US11987236B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 23, 2021 |
| Grant date | May 21, 2024 |
| Priority date | — |
| Expiry date | Sep 13, 2042 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY02T10/84
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method provided for 3D object localization predicts pairs of 2D bounding boxes. Each pair corresponds to a detected object in each of the two consecutive input monocular images. The method generates, for each detected object, a relative motion estimation specifying a relative motion between the two images. The method constructs an object cost volume by aggregating temporal features from the two images using the pairs of 2D bounding boxes and the relative motion estimation to predict a range of object depth candidates and a confidence score for each object depth candidate and an object depth from the object depth candidates. The method updates the relative motion estimation based on the object cost volume and the object depth to provide a refined object motion and a refined object depth. The method reconstructs a 3D bounding box for each detected object based on the refined object motion and refined object depth.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.