Modular deep learning model
US10235994B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 30, 2016 |
| Grant date | Mar 19, 2019 |
| Priority date | — |
| Expiry date | Jun 30, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/28
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The technology described herein uses a modular model to process speech. A deep learning based acoustic model comprises a stack of different types of neural network layers. The sub-modules of a deep learning based acoustic model can be used to represent distinct non-phonetic acoustic factors, such as accent origins (e.g. native, non-native), speech channels (e.g. mobile, bluetooth, desktop etc.), speech application scenario (e.g. voice search, short message dictation etc.), and speaker variation (e.g. individual speakers or clustered speakers), etc. The technology described herein uses certain sub-modules in a first context and a second group of sub-modules in a second context.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.