Patent · US Active

Dense captioning with joint interference and visual context

US11361489B2 · kind B2 · utility

0Cited by
27References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 17, 2020
Grant dateJun 14, 2022
Priority date
Expiry dateJun 17, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06T2210/12
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A dense captioning system and method is provided for analyzing an image to generate proposed bounding regions for a plurality of visual concepts within the image, generating a region feature for each proposed bounding region to generate a plurality of region features of the image, and determining a context feature for the image using a proposed bounding region that is a largest in size of the proposed bounding regions. For each region feature of the plurality of region features of the image, the dense captioning system and method further provides for analyzing the region feature to determine for the region feature a detection score that indicates a likelihood that the region feature comprises an actual object, and generating a caption for a visual concept in the image using the region feature and the context feature when a detection score is above a specified threshold value.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.