Patent · US Active

Method and device for visual question answering, computer apparatus and medium

US11768876B2 · kind B2 · utility

1Cited by
0References
12Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 28, 2021
Grant dateSep 26, 2023
Priority date
Expiry dateJan 9, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V2201/07
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present disclosure provides a method for visual question answering, which relates to a field of computer vision and natural language processing. The method includes: acquiring an input image and an input question; constructing a Visual Graph based on the input image, wherein the Visual Graph comprises a Node Feature and an Edge Feature; updating the Node Feature by using the Node Feature and the Edge Feature to obtain an updated Visual Graph; determining a question feature based on the input question; fusing the updated Visual Graph and the question feature to obtain a fused feature; and generating a predicted answer for the input image and the input question based on the fused feature. The present disclosure further provides an apparatus for visual question answering, a computer device and a non-transitory computer-readable storage medium.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.