Patent · US Active

Visual dialogue method and system

US12223284B2 · kind B2 · utility

0Cited by
3References
4Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 27, 2022
Grant dateFeb 11, 2025
Priority date
Expiry dateJun 27, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V10/82
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A visual dialogue method and system is provided. The method includes obtaining original input data, where the original input data includes current image data and a new question, and the new question is related to the current image data; preprocessing text data and image data in the original input data to obtain a text feature sequence and a visual feature sequence, respectively; using a VisDial dataset to construct a text corpus; obtaining text sequence knowledge by using a potential knowledge searcher based on the visual feature sequence and the text corpus; constructing a sparse scene graph based on the visual feature sequence; performing data fusion on the text feature sequence, the visual feature sequence, the text sequence knowledge, and the sparse scene graph to obtain a data fusion result; and obtaining dialogue content of the new question by using a decoder based on the data fusion result.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.