Accent invariant speech recognition
US10446136B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 11, 2017 |
| Grant date | Oct 15, 2019 |
| Priority date | — |
| Expiry date | Jun 24, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/0631
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for accent invariant speech recognition comprising: maintaining a database scoring a set of language units in a given language, and for each of the language units, scoring audio samples of pronunciation variations of the language unit pronounced by a plurality of speakers; extracting and storing m the database a feature vector for locating each of the audio samples in a feature space; identifying pronunciation variation distances, which are distances between locations of audio samples of the same language unit in the feature space, and inter-unit distances, which are distances between locations of audio samples of different language units in the feature space; calculating a transformation applicable on the feature space to reduce the pronunciation variation distances relative to the inter-unit distances; and based on the calculated transformation, training a processor to classify as a same language unit pronunciation variations of the same language unit.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.