Disambiguation in speech recognition
US9558740B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 30, 2015 |
| Grant date | Jan 31, 2017 |
| Priority date | — |
| Expiry date | Mar 30, 2035 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/30
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Automatic speech recognition (ASR) processing including a feedback configuration to allow for improved disambiguation between ASR hypotheses. After ASR processing of an incoming utterance where the ASR outputs an N-best list including multiple hypotheses, the multiple hypotheses are passed downstream for further processing. The downstream further processing may include natural language understanding (NLU) or other processing to determine a command result for each hypothesis. The command results are compared to determine if any hypotheses of the N-best list would yield similar command results. If so, the hypothesis(es) with similar results are removed from the N-best list so that only one hypothesis of the similar results remains in the N-best list. The remaining non-similar hypotheses are sent for disambiguation, or, if only one hypothesis remains, it is sent for execution.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.