Patent · US Active

Generation of image corresponding to input text using multi-text guided image cropping

US12131406B2 · kind B2 · utility

1Cited by

1References

20Claims

0Family size

Assignee

Lemon Inc. · KR

Inventors

Bingchen Liu · Los Angeles, US
Yizhe Zhu · Los Angeles, US
Xiao YANG · Singapore, SG

Key dates

Filing date	Nov 4, 2022
Grant date	Oct 29, 2024
Priority date	—
Expiry date	Dec 10, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG06T5/70
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Systems and methods are provided that include a processor executing a program to receive an input from a user, where the input including a first input text and a second input text. The processor is further configured to provide an initial image and, for a predetermined number of iterations, define a first and second regions of the initial image associated with the first and second input texts, respectively, define a plurality of patches of the initial image, input the initial image into a diffusion process to generate a processed image, back-propagate the processed image through a text-image match gradient calculator by generating an image embedding based on the processed image, generating a text embedding based on the region and the input text that are associated with a patch, and calculating a differential between the image embedding and the text embedding.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.