Batch matrix multiplication operations in a machine learning accelerator
US12197362B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 26, 2023 |
| Grant date | Jan 14, 2025 |
| Priority date | — |
| Expiry date | Jul 7, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F17/16
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
In one embodiment, a method includes, determining that a bmm operation between a first activation tensor and a second activation tensor needs to be performed, collecting the second activation tensor in two blocks from activation buffers of N tensor processor units, splitting each of the two blocks of the second activation tensor into an MSB tile and an LSB tile, loading the second activation tensor to weight buffers of the N tensor processor units by filling a first entry of each weight buffer of each of the N tensor processor units with contents of the MSB tiles of the two blocks and filling a second entry of the weight buffer with contents of the LSB tiles of the two blocks, and generating a bmm result using the first activation tensor distributed in the activation buffers and the second activation tensor in the weight buffers.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.