Patent · US Active

Visual language processing modeling framework via an attention-on-attention mechanism

US12197877B2 · kind B2 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 2, 2022
Grant dateJan 14, 2025
Priority date
Expiry dateNov 2, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06T2207/30201
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed are various embodiments for a visual language processing modeling framework via an attention-on-attention mechanism, which may be employed for object identification, classification, and the like. In association with a display of a user interface, an eye tracking via images captured by an imaging device is performed to programmatically detect eye movement and fixation relative to sub-regions of the user interface. Eye fixations on at least one of the sub-regions from the eye tracking. Visual cues are extracted from the user interface based at least in part on the eye fixations, the visual cues being in a sequence of identification. A visual language sentence is generated based at least in part on the visual cues as extracted. The visual language sentence of the visual cues in the sequence of identification is correlated to at least one decision using a visual language understanding routine.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.