Patent · US Active

Method and device for optimizing neural network

US12079722B2 · kind B2 · utility

0Cited by

1References

22Claims

0Family size

Assignee

Beijing Tusen Zhitu Technology Co., Ltd. · CN

Inventors

Yuwei Hu · Beijing, CN
Jiangming JIN · Beijing, CN
Lei Su · Beijing, CN
Dinghua Li · San Diego, US

Key dates

Filing date	Feb 1, 2023
Grant date	Sep 3, 2024
Priority date	—
Expiry date	Feb 1, 2043

Classification

Technology area (CPC H)Electricity
CPC primaryH03M7/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

The embodiments of this application provide a method and device for optimizing neural network. The method includes: binarizing and bit-packing input data of a convolution layer along a channel direction, and obtaining compressed input data; binarizing and bit-packing respectively each convolution kernel of the convolution layer along the channel direction, and obtaining each corresponding compressed convolution kernel; dividing the compressed input data sequentially in a convolutional computation order into blocks of the compressed input data with the same size of each compressed convolution kernel, wherein the data input to one time convolutional computation form a data block; and, taking a convolutional computation on each block of the compressed input data and each compressed convolution kernel sequentially, obtaining each convolutional result data, and obtaining multiple output data of the convolution layer according to each convolutional result data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.