Patent · US Active

Learning of policy for selection of associative topic in dialog system

US11574550B2 · kind B2 · utility

1Cited by

1References

16Claims

0Family size

Assignee

International Business Machines Corporation · US

Inventors

Hiroshi Kanayama · Yokohama, JP
Akira Koseki · Sagamihara, JP
Toshiro Takase · Urayasu, JP

Key dates

Filing date	Nov 1, 2017
Grant date	Feb 7, 2023
Priority date	—
Expiry date	May 18, 2038

Classification

Technology area (CPC G)Physics
CPC primaryG09B5/06
WIPO fieldControl
WIPO sectorInstruments

Abstract

A computer-implemented method for learning a policy for selection of an associative topic, which can be used in a dialog system, is described. The method includes obtaining a policy base that indicates a topic transition from a source topic to a destination topic and a short-term reward for the topic transition, by analyzing data from a corpus. The short-term reward may be defined as probability of associating a positive response. The method also includes calculating an expected long-term reward for the topic transition using the short-term reward for the topic transition with taking into account a discounted reward for a subsequent topic transition. The method further includes generating a policy using the policy base and the expected long-term reward for the topic transition. The policy indicates selection of the destination topic for the source topic as an associative topic for a current topic.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.