Patent · US Active

Visual language processing modeling framework via an attention-on-attention mechanism

US12197877B2 · kind B2 · utility

0Cited by

1References

20Claims

0Family size

Assignee

VIRGINIA TECH INTELLECTUAL PROPERTIES, INC. · US

Inventors

Ran Jin · Hubei, CN
Xiaoyu Chen · Nanhu, CN

Key dates

Filing date	Nov 2, 2022
Grant date	Jan 14, 2025
Priority date	—
Expiry date	Nov 2, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG06T2207/30201
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Disclosed are various embodiments for a visual language processing modeling framework via an attention-on-attention mechanism, which may be employed for object identification, classification, and the like. In association with a display of a user interface, an eye tracking via images captured by an imaging device is performed to programmatically detect eye movement and fixation relative to sub-regions of the user interface. Eye fixations on at least one of the sub-regions from the eye tracking. Visual cues are extracted from the user interface based at least in part on the eye fixations, the visual cues being in a sequence of identification. A visual language sentence is generated based at least in part on the visual cues as extracted. The visual language sentence of the visual cues in the sequence of identification is correlated to at least one decision using a visual language understanding routine.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.