Prompt-to-prompt image editing with cross-attention control
US12430830B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 31, 2023 |
| Grant date | Sep 30, 2025 |
| Priority date | — |
| Expiry date | Jan 6, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/096
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Some implementations are directed to editing a source image, where the source image is one generated based on processing a source natural language (NL) prompt using a Large-scale language-image (LLI) model. Those implementations edit the source image based on user interface input that indicates an edit to the source NL prompt, and optionally independent of any user interface input that specifies a mask in the source image and/or independent of any other user interface input. Some implementations of the present disclosure are additionally or alternatively directed to applying prompt-to-prompt editing techniques to editing a source image that is one generated based on a real image, and that approximates the real image.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.