Visual language processing modeling framework via an attention-on-attention mechanism
US12197877B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 2, 2022 |
| Grant date | Jan 14, 2025 |
| Priority date | — |
| Expiry date | Nov 2, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06T2207/30201
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed are various embodiments for a visual language processing modeling framework via an attention-on-attention mechanism, which may be employed for object identification, classification, and the like. In association with a display of a user interface, an eye tracking via images captured by an imaging device is performed to programmatically detect eye movement and fixation relative to sub-regions of the user interface. Eye fixations on at least one of the sub-regions from the eye tracking. Visual cues are extracted from the user interface based at least in part on the eye fixations, the visual cues being in a sequence of identification. A visual language sentence is generated based at least in part on the visual cues as extracted. The visual language sentence of the visual cues in the sequence of identification is correlated to at least one decision using a visual language understanding routine.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.