Method and system for mapping states and actions of an intelligent agent
US9311600B1 · kind B1 · utility
Inventor
Key dates
| Filing date | Jun 2, 2013 |
| Grant date | Apr 12, 2016 |
| Priority date | — |
| Expiry date | Nov 20, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG05B2219/33002
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and system comprise providing means and method for producing, modifying, and/or exploiting the structure of a policy manifold. Each of the policies at least comprises information for mapping state and/or sensory information as input to action preferences as output. One or more processing units assign each of the policies a policy coordinate on a policy manifold. The policy coordinate may in part be determined by a dissimilarity matrix or other means for organizing the coordinates of the policies on the policy manifold according to the properties of the policies and the topology of the policy manifold. The policy manifold comprises a dimensionality that is lower than a combined dimensionality of the input and the output, wherein the policy manifold at least in part determines a behavior of the intelligent artificial agent.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.