Optimizing dialogue policy decisions for digital assistants using implicit feedback
US10810274B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 15, 2017 |
| Grant date | Oct 20, 2020 |
| Priority date | — |
| Expiry date | May 25, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/223
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
Systems and processes for optimizing dialogue policy decisions for digital assistants using implicit feedback are provided. In an example process, a user utterance is received. Based on a text representation of the user utterance, one or more user intents corresponding to the user utterance are determined. A policy action is selected from a plurality of candidate policy actions based on a belief state for the one or more user intents and a policy model. The policy action is performed, including outputting results of the policy action for presentation. A success score for the policy action is determined based on whether one or more predetermined types of implicit user feedback are detected after performing the policy action. A set of parameter values of the policy model is modified using the determined success score.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.