Patent · US Active

Method and apparatus for visual question answering, computer device and medium

US11775574B2 · kind B2 · utility

0Cited by
0References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 23, 2021
Grant dateOct 3, 2023
Priority date
Expiry dateDec 17, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for visual question answering, a computer device implementing the method and a medium for storing instructions on performing the method are provided. The method includes: acquiring an input image and an input question; constructing a visual graph based on the input image, wherein the visual graph comprises a first node feature and a first edge feature; constructing a question graph based on the input question, wherein the question graph comprises a second node feature and a second edge feature; performing a multimodal fusion on the visual graph and the question graph to obtain an updated visual graph and an updated question graph; determining a question feature based on the input question; determining a fusion feature based on the updated visual graph, the updated question graph and the question feature; and generating a predicted answer for the input image and the input question.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.