Sensor data fusion using cross-modal transformer
US11921824B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 29, 2021 |
| Grant date | Mar 5, 2024 |
| Priority date | — |
| Expiry date | Mar 25, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V20/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques are generally described for fusing sensor data of different modalities using a transformer. In various examples, first sensor data may be received from a first sensor and second sensor data may be received from a second sensor. A first feature representation of the first sensor data may be generated using a first machine learning model and a second feature representation of the second sensor data may be generated using a second machine learning model. In some examples, a modified first feature representation of the first sensor data may be generated based at least in part on a self-attention mechanism of a transformer encoder. The modified first feature representation may be generated based at least in part on the first feature representation and the second feature representation. A computer vision task may be performed using the modified first feature representation.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.