Patent · US Active

Prompt-to-prompt image editing with cross-attention control

US12430830B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 31, 2023
Grant dateSep 30, 2025
Priority date
Expiry dateJan 6, 2044

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/096
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Some implementations are directed to editing a source image, where the source image is one generated based on processing a source natural language (NL) prompt using a Large-scale language-image (LLI) model. Those implementations edit the source image based on user interface input that indicates an edit to the source NL prompt, and optionally independent of any user interface input that specifies a mask in the source image and/or independent of any other user interface input. Some implementations of the present disclosure are additionally or alternatively directed to applying prompt-to-prompt editing techniques to editing a source image that is one generated based on a real image, and that approximates the real image.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.