Visual dialogue method and system
US12223284B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 27, 2022 |
| Grant date | Feb 11, 2025 |
| Priority date | — |
| Expiry date | Jun 27, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V10/82
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A visual dialogue method and system is provided. The method includes obtaining original input data, where the original input data includes current image data and a new question, and the new question is related to the current image data; preprocessing text data and image data in the original input data to obtain a text feature sequence and a visual feature sequence, respectively; using a VisDial dataset to construct a text corpus; obtaining text sequence knowledge by using a potential knowledge searcher based on the visual feature sequence and the text corpus; constructing a sparse scene graph based on the visual feature sequence; performing data fusion on the text feature sequence, the visual feature sequence, the text sequence knowledge, and the sparse scene graph to obtain a data fusion result; and obtaining dialogue content of the new question by using a decoder based on the data fusion result.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.