Patent · US Active

Online asynchronous reinforcement learning from concurrent customer histories

US8909590B2 · kind B2 · utility

12Cited by

5References

29Claims

0Family size

Assignee

NICE SYSTEMS TECHNOLOGIES UK LIMITED · GB

Inventors

Leonard Michael Newnham · London, GB
Jason Derek McFall · London, GB
David James Barker · Faringdon, GB
David Silver · Hitchin, GB

Key dates

Filing date	Sep 28, 2012
Grant date	Dec 9, 2014
Priority date	—
Expiry date	Jun 14, 2033

Classification

Technology area (CPC G)Physics
CPC primaryG06N20/00
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

In one embodiment, an indication of a Decision Request or an Update Request may be received, where the Update Request is activated independent of user activity. A user state pertaining to at least one user may be received, obtained, accessed or constructed. For the Decision Request, one or more actions may be scored according to one or more value functions associated with a computing device, a policy associated with the computing device may be applied to identify one of the scored actions as a decision, and an indication of the decision may be provided or applied. For the Update Request, the one or more value functions and/or the policy may be updated. An indication of updates to the one or more value functions and/or an indication of updates to the policy may be provided.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.