Patent · US Active

Method and apparatus for compressing neural network model

US11861498B2 · kind B2 · utility

0Cited by

0References

17Claims

0Family size

Assignee

BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD. · CN

Inventors

Guibin Wang · Nanjing, CN
Shijun Cong · Beijing, CN
Hao Dong · Santa Clara, US
Lei Jia · Beijing, CN

Key dates

Filing date	Oct 18, 2022
Grant date	Jan 2, 2024
Priority date	—
Expiry date	Oct 18, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/0495
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method for compressing a neural network model includes acquiring a to-be-compressed neural network model. A first bit width, a second bit width and a target thinning rate corresponding to the to-be-compressed neural network model are determined. A target value is obtained according to the first bit width, the second bit width and the target thinning rate. Then the to-be-compressed neural network model is compressed using the target value, the first bit width and the second bit width to obtain a compression result of the to-be-compressed neural network model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.