Efficient deployment of machine learning and deep learning model's pipeline for serving service level agreement
US11775264B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 2, 2021 |
| Grant date | Oct 3, 2023 |
| Priority date | — |
| Expiry date | Apr 8, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
This disclosure relates generally to configuring/building of applications. Typically, a deep learning (DL) application having multiple models composed and interspersed with corresponding transformation functions has no mechanism of efficient deployment on underlying system resources. The disclosed system accelerates the development of application to compose multiple models where each model could be a primitive model or a composite model itself. In an embodiment, the disclosed system optimally deploys a composable model application and transformation functions on underlying resources using performance prediction models, thereby accelerating the development and deployment of the application.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.