Instructions and logic to perform floating-point and integer operations for machine learning
US11169799B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 5, 2019 |
| Grant date | Nov 9, 2021 |
| Priority date | — |
| Expiry date | Jun 5, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06T15/005
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
One embodiment provides for a graphics processing unit to accelerate machine-learning operations, the graphics processing unit comprising a multiprocessor having a single instruction, multiple thread (SIMT) architecture, the multiprocessor to execute at least one single instruction; and a first compute unit included within the multiprocessor, the at least one single instruction to cause the first compute unit to perform a two-dimensional matrix multiply and accumulate operation, wherein to perform the two-dimensional matrix multiply and accumulate operation includes to compute a 32-bit intermediate product of 16-bit operands and to compute a 32-bit sum based on the 32-bit intermediate product.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.