Patent · US Active

Analytical model to optimize deep learning models

US12293299B1 · kind B1 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 3, 2021
Grant dateMay 6, 2025
Priority date
Expiry dateMar 5, 2044

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/063
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for optimizing and deploying deep neural network (CNN) machine learning models for inference using static analysis are described. A method includes obtaining a deep neural network (DNN) machine learning (ML) model, generating an intermediate representation for the ML model, the intermediate representation including one or more nodes corresponding to one or more operators utilized by the ML model, identifying, for at least one node of the intermediate representation, an optimized schedule for at least one operator corresponding to the at least one node using a static analysis that is based on a hardware-specific cost model, generating an optimized intermediate representation using the optimized schedule that is optimized for execution on a hardware platform, and generating code corresponding to the ML model based at least in part on the optimized intermediate representation, wherein the code is specific to the hardware platform.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.