Patent · US Active

Speech recognition and text-to-speech learning system

US10089974B2 · kind B2 · utility

1Cited by
10References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 31, 2016
Grant dateOct 2, 2018
Priority date
Expiry dateApr 9, 2036

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/07
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

An example text-to-speech learning system performs a method for generating a pronunciation sequence conversion model. The method includes generating a first pronunciation sequence from a speech input of a training pair and generating a second pronunciation sequence from a text input of the training pair. The method also includes determining a pronunciation sequence difference between the first pronunciation sequence and the second pronunciation sequence; and generating a pronunciation sequence conversion model based on the pronunciation sequence difference. An example speech recognition learning system performs a method for generating a pronunciation sequence conversion model. The method includes extracting an audio signal vector from a speech input and applying an audio signal conversion model to the audio signal vector to generate a converted audio signal vector. The method also includes adapting an acoustic model based on the converted audio signal vector to generate an adapted acoustic model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.