Patent · US Active

Multimodal dialog state tracking and action prediction for assistant systems

US11704745B2 · kind B2 · utility

1Cited by
80References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 28, 2020
Grant dateJul 18, 2023
Priority date
Expiry dateJul 15, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/228
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In one embodiment, a method includes receiving, from a client system associated with a user, a user request comprising a reference to a target object, accessing visual data from the client system, wherein the visual data comprises images portraying the target object and one or more additional objects, and wherein attribute information of the target object is recorded in a multimodal dialog state, resolving the reference to the target object based on the attribute information recorded in the multimodal dialog state, determining relational information between the target object and one or more of the additional objects portrayed in the visual data, and sending, to the client system, instructions for presenting a response to the user request, wherein the response comprises the attribute information and the determined relational information.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.