Image segmentation using text embedding
US12008698B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 3, 2023 |
| Grant date | Jun 11, 2024 |
| Priority date | — |
| Expiry date | Mar 3, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06T2207/20084
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, using a model, a learned image representation of a target image. The operations further include generating, using a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image based on the convolving of the learned image representation of the target image with the text embedding.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.