Auto-scaling hosted machine learning models for production inference
US11126927B2 · kind B2 · utility
1Cited by
3References
20Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Nov 24, 2017 |
| Grant date | Sep 21, 2021 |
| Priority date | — |
| Expiry date | Jun 15, 2040 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04L61/5007
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for auto-scaling hosted machine learning models for production inference are described. A machine learning model can be deployed in a hosted environment such that the infrastructure supporting the machine learning model scales dynamically with demand so that performance is not impacted. The model can be auto-scaled using reactive techniques or predictive techniques.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.