Method and device for visual question answering, computer apparatus and medium
US11768876B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 28, 2021 |
| Grant date | Sep 26, 2023 |
| Priority date | — |
| Expiry date | Jan 9, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V2201/07
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present disclosure provides a method for visual question answering, which relates to a field of computer vision and natural language processing. The method includes: acquiring an input image and an input question; constructing a Visual Graph based on the input image, wherein the Visual Graph comprises a Node Feature and an Edge Feature; updating the Node Feature by using the Node Feature and the Edge Feature to obtain an updated Visual Graph; determining a question feature based on the input question; fusing the updated Visual Graph and the question feature to obtain a fused feature; and generating a predicted answer for the input image and the input question based on the fused feature. The present disclosure further provides an apparatus for visual question answering, a computer device and a non-transitory computer-readable storage medium.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.