Patent · US Active

Method and device for reducing a size of a neural network model

US11915138B2 · kind B2 · utility

0Cited by

0References

14Claims

0Family size

Assignee

Alibaba Group Holding Limited · KY

Inventors

Weifeng Zhang · Nanhu, CN
Guoyang CHEN · San Mateo, US
Yu Pu · San Diego, US
Yongzhi Zhang · Wayland, US
Yuan Xie · San Francisco, US

Key dates

Filing date	Feb 18, 2020
Grant date	Feb 27, 2024
Priority date	—
Expiry date	Jun 22, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/045
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods and apparatus for reducing a size of a neural network model, the method including: compressing data of the neural network model; identifying structure information of a vector register, wherein the structure information includes a number of registers included in the vector register; comparing a number of elements in the compressed data with a first condition, wherein the first condition is determined based on the number of registers in the vector register; and in response to the number of elements satisfying the first condition, associating the compressed data with the vector register to enable loading the compressed data to the vector register.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.