Patent · US Active

System and method for routing optimization

US10655975B2 · kind B2 · utility

2Cited by

7References

20Claims

0Family size

Assignee

Alibaba Group Holding Limited · KY

Inventors

Xingwen Zhang · Hangzhou City, CN
Hao Lu · 安丰镇, CN
Zhigang Hua · Hangzhou City, CN
Shuang Yang · Ellicott City, US

Key dates

Filing date	Dec 20, 2019
Grant date	May 19, 2020
Priority date	—
Expiry date	Dec 20, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG06N20/00
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining routing using reinforcement learning (RL) are provided. One of the methods includes: initializing a state of an RL model based on a routing solution, wherein the RL model comprises a plurality of improvement actions for applying to the state; applying one or more of the plurality of improvement actions to the state to obtain updated routing solutions until a predetermined condition is satisfied; applying a perturbation action to obtain a perturbed routing solution and feeding the perturbed routing solution back to the RL model for the RL model to perform the applying one or more of the plurality of improvement actions according to the policy; and determining a routing solution with a minimum cost from the updated routing solutions.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.