Patent · US Active

Reinforcement learning for active sequence processing

US12175737B2 · kind B2 · utility

0Cited by

4References

31Claims

0Family size

Assignee

DeepMind Technologies Limited · GB

Inventors

Viorica Patraucean · London, GB
Bilal Piot · London, GB
Joao Carreira · Childwickbury, GB
Volodymyr Mnih · Toronto, CA
Simon Osindero · London, GB

Key dates

Filing date	Nov 13, 2020
Grant date	Dec 24, 2024
Priority date	—
Expiry date	Jan 8, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG06N7/01
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A system that is configured to receive a sequence of task inputs and to perform a machine learning task is described. The system includes a reinforcement learning (RL) neural network and a task neural network. The RL neural network is configured to: generate, for each task input of the sequence of task inputs, a respective decision that determines whether to encode the task input or to skip the task input, and provide the respective decision of each task input to the task neural network. The task neural network is configured to: receive the sequence of task inputs, receive, from the RL neural network, for each task input of the sequence of task inputs, a respective decision that determines whether to encode the task input or to skip the task input, process each of the un-skipped task inputs in the sequence of task inputs to generate a respective accumulated feature for the un-skipped task input, wherein the respective accumulated feature characterizes features of the un-skipped task input and of previous un-skipped task inputs in the sequence, and generate a machine learning task output for the machine learning task based on the last accumulated feature generated for the last un-sk…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.