Automatically scaling compute resources for heterogeneous workloads
US10761893B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 23, 2018 |
| Grant date | Sep 1, 2020 |
| Priority date | — |
| Expiry date | Feb 28, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F2209/508
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques are described for automatically scaling (or “auto scaling”) compute resources—for example, virtual machine (VM) instances, containers, or standalone servers—used to support execution of service-oriented software applications and other types of applications that may process heterogeneous workloads. The resource requirements for a software application can be approximated by measuring “worker pool” utilization of instances of each service, where a worker pool represents a number of requests that the service can process concurrently. A scaling service can thus be configured to scale the compute instances provisioned for a service in proportion to worker pool utilization, that is, compute instances can be added as the fleet's worker pools become more “busy,” while compute instances can be removed when worker pools become inactive.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.