Enhanced endpoint detection for speech recognition
US9437186B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 19, 2013 |
| Grant date | Sep 6, 2016 |
| Priority date | — |
| Expiry date | Feb 15, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/223
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Determining the end of an utterance for purposes of automatic speech recognition (ASR) may be improved with a system that provides early results and/or incorporates semantic tagging. Early ASR results of an incoming utterance may be prepared based at least in part on an estimated endpoint and processed by a natural language understanding (NLU) process while final results, based at least in part on a final endpoint, are determined. If the early results match the final results, the early NLU results are already prepared for early execution. The endpoint may also be determined based at least in part on the content of the utterance, as represented by semantic tagging output from ASR processing. If the tagging indicate completion of a logical statement, an endpoint may be declared, or a threshold for silent frames prior to declaring an endpoint may be adjusted.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.