Patent · US Active

Using large generative models with improved grounding to improve image context queries

US12405995B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 4, 2023
Grant dateSep 2, 2025
Priority date
Expiry dateDec 4, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/90332
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The disclosure describes utilizing an image query system to improve response accuracy and reduce computational steps resources in responding to natural language queries of input images. In various implementations, the image query system utilizes grounding information from one or more sources to determine accurate information for an input image. For example, the image query system uses a single comprehensive image prompt to obtain extensive visual image grounding information for the input image from a visual-based large generative model. Additionally, or in alternative implementations, the image query system obtains reverse image search grounding information for the input image. The image query system then cleverly utilizes the grounding information with a large generative language model to generate text query responses to image-based queries of the input image more accurately and efficiently.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.