Patent · US Active

System and method for routing optimization

US10655975B2 · kind B2 · utility

2Cited by
7References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 20, 2019
Grant dateMay 19, 2020
Priority date
Expiry dateDec 20, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining routing using reinforcement learning (RL) are provided. One of the methods includes: initializing a state of an RL model based on a routing solution, wherein the RL model comprises a plurality of improvement actions for applying to the state; applying one or more of the plurality of improvement actions to the state to obtain updated routing solutions until a predetermined condition is satisfied; applying a perturbation action to obtain a perturbed routing solution and feeding the perturbed routing solution back to the RL model for the RL model to perform the applying one or more of the plurality of improvement actions according to the policy; and determining a routing solution with a minimum cost from the updated routing solutions.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.