Generation of image corresponding to input text using multi-text guided image cropping
US12131406B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 4, 2022 |
| Grant date | Oct 29, 2024 |
| Priority date | — |
| Expiry date | Dec 10, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06T5/70
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems and methods are provided that include a processor executing a program to receive an input from a user, where the input including a first input text and a second input text. The processor is further configured to provide an initial image and, for a predetermined number of iterations, define a first and second regions of the initial image associated with the first and second input texts, respectively, define a plurality of patches of the initial image, input the initial image into a diffusion process to generate a processed image, back-propagate the processed image through a text-image match gradient calculator by generating an image embedding based on the processed image, generating a text embedding based on the region and the input text that are associated with a patch, and calculating a differential between the image embedding and the text embedding.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.