Patent · US Active

Image description generation method, apparatus and system, and medium and electronic device

US12073639B2 · kind B2 · utility

0Cited by
1References
18Claims
0Family size

Assignees

Inventors

Key dates

Filing dateMar 2, 2021
Grant dateAug 27, 2024
Priority date
Expiry dateAug 9, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V2201/07
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present disclosure relates to the technical field of image processing, and in particular to an image description generation method, apparatus and system, and a medium and an electronic device. The method comprises: acquiring one or more image region features in a target image, and obtaining a current input vector by performing a mean pooling on the image region features; obtaining respective outer product vectors of the image region features by respectively linearly fusing the current input vector and each of the image region features; calculating, based on the respective outer product vectors of the image region features, an attention distribution of the image region features in a spatial dimension and an attention distribution of the image region features in a channel dimension; and generating an image description of the target image based on the attention distribution of the image region features in the spatial dimension and the attention distribution of the image region features in the channel dimension.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.