Singular value decomposition for improved voice recognition in presence of multi-talker background noise
US9177557B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 7, 2009 |
| Grant date | Nov 3, 2015 |
| Priority date | — |
| Expiry date | Apr 18, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2021/02087
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for providing speech recognition functionality offers improved accuracy and robustness in noisy environments having multiple speakers. The described technique includes receiving speech energy and converting the received speech energy to a digitized form. The digitized speech energy is decomposed into features that are then projected into a feature space having multiple speaker subspaces. The projected features fall either into one of the multiple speaker subspaces or outside of all speaker subspaces. A speech recognition operation is performed on a selected one of the multiple speaker subspaces to resolve the utterance to a command or data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.