Patent · US Active

Image segmentation using text embedding

US11615567B2 · kind B2 · utility

6Cited by
4References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 18, 2020
Grant dateMar 28, 2023
Priority date
Expiry dateNov 18, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06T2207/20084
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A non-transitory computer-readable medium includes program code that is stored thereon. The program code is executable by one or more processing devices for performing operations including generating, by a model that includes trainable components, a learned image representation of a target image. The operations further include generating, by a text embedding model, a text embedding of a text query. The text embedding and the learned image representation of the target image are in a same embedding space. Additionally, the operations include generating a class activation map of the target image by, at least, convolving the learned image representation of the target image with the text embedding of the text query. Moreover, the operations include generating an object-segmented image using the class activation map of the target image.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.