Patent · US Active

Generation of image corresponding to input text using multi-text guided image cropping

US12131406B2 · kind B2 · utility

1Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 4, 2022
Grant dateOct 29, 2024
Priority date
Expiry dateDec 10, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06T5/70
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods are provided that include a processor executing a program to receive an input from a user, where the input including a first input text and a second input text. The processor is further configured to provide an initial image and, for a predetermined number of iterations, define a first and second regions of the initial image associated with the first and second input texts, respectively, define a plurality of patches of the initial image, input the initial image into a diffusion process to generate a processed image, back-propagate the processed image through a text-image match gradient calculator by generating an image embedding based on the processed image, generating a text embedding based on the region and the input text that are associated with a patch, and calculating a differential between the image embedding and the text embedding.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.