Patent · US Active

Structured pruning for machine learning model

US11816574B2 · kind B2 · utility

0Cited by

3References

20Claims

0Family size

Assignee

Alibaba Group Holding Limited · KY

Inventors

Ao Ren · San Mateo, US
Yuhao Wang · Hangzhou City, CN
Tao Zhang · Beijing, CN
Yuan Xie · San Francisco, US

Key dates

Filing date	Oct 25, 2019
Grant date	Nov 14, 2023
Priority date	—
Expiry date	Jul 30, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/044
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

An input weight pattern of a machine learning model may be received. The input weight pattern may be pruned to produce an output weight pattern based on a predetermined pruning algorithm. The pruning algorithm may include partitioning the input weight pattern into a plurality of sub-patterns, each row of the input weight pattern including sub-rows of a first number of sub-patterns, and each column of the input weight pattern including sub-columns of a second number of sub-patterns; and pruning sub-columns and sub-rows from the plurality of sub-patterns to achieve predetermined column and row sparsities respectively, with a constraint that at least one sub-row in each row of the input weight pattern is not pruned. The output weight pattern may further be compressed to produce a compact weight pattern. The compact weight pattern has lower memory and computational overheads as compared to the input weight pattern for the machine learning model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.