Patent · US Active

Unified optimization for convolutional neural network model inference on integrated graphics processing units

US11797876B1 · kind B1 · utility

6Cited by
15References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 26, 2019
Grant dateOct 24, 2023
Priority date
Expiry dateAug 24, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/08
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for optimizing and deploying convolutional neural network (CNN) machine learning models for inference using integrated graphics processing units are described. A model compilation system optimizes CNN models using optimized vision-specific operators as well as both graph-level tuning and tensor-level tuning to explore the optimization space for achieving heightened performance. The model compilation system may also implement a heuristic-based two-stage technique for falling back certain operators of CNN models to use CPUs when needed or otherwise beneficial.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.