Embedding space for images with multiple text labels
US10026020B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 15, 2016 |
| Grant date | Jul 17, 2018 |
| Priority date | — |
| Expiry date | Jan 15, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/274
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Embedding space for images with multiple text labels is described. In the embedding space both text labels and image regions are embedded. The text labels embedded describe semantic concepts that can be exhibited in image content. The embedding space is trained to semantically relate the embedded text labels so that labels like “sun” and “sunset” are more closely related than “sun” and “bird”. Training the embedding space also includes mapping representative images, having image content which exemplifies the semantic concepts, to respective text labels. Unlike conventional techniques that embed an entire training image into the embedding space for each text label associated with the training image, the techniques described herein process a training image to generate regions that correspond to the multiple text labels. The regions of the training image are then embedded into the training space in a manner that maps the regions to the corresponding text labels.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.