Speech recognition and text-to-speech learning system
US10089974B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 31, 2016 |
| Grant date | Oct 2, 2018 |
| Priority date | — |
| Expiry date | Apr 9, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/07
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An example text-to-speech learning system performs a method for generating a pronunciation sequence conversion model. The method includes generating a first pronunciation sequence from a speech input of a training pair and generating a second pronunciation sequence from a text input of the training pair. The method also includes determining a pronunciation sequence difference between the first pronunciation sequence and the second pronunciation sequence; and generating a pronunciation sequence conversion model based on the pronunciation sequence difference. An example speech recognition learning system performs a method for generating a pronunciation sequence conversion model. The method includes extracting an audio signal vector from a speech input and applying an audio signal conversion model to the audio signal vector to generate a converted audio signal vector. The method also includes adapting an acoustic model based on the converted audio signal vector to generate an adapted acoustic model.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.