Patent · US Active

Sensor data fusion using cross-modal transformer

US11921824B1 · kind B1 · utility

7Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 29, 2021
Grant dateMar 5, 2024
Priority date
Expiry dateMar 25, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V20/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques are generally described for fusing sensor data of different modalities using a transformer. In various examples, first sensor data may be received from a first sensor and second sensor data may be received from a second sensor. A first feature representation of the first sensor data may be generated using a first machine learning model and a second feature representation of the second sensor data may be generated using a second machine learning model. In some examples, a modified first feature representation of the first sensor data may be generated based at least in part on a self-attention mechanism of a transformer encoder. The modified first feature representation may be generated based at least in part on the first feature representation and the second feature representation. A computer vision task may be performed using the modified first feature representation.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.