Patent · US Active

Enhanced endpoint detection for speech recognition

US9437186B1 · kind B1 · utility

318Cited by
9References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 19, 2013
Grant dateSep 6, 2016
Priority date
Expiry dateFeb 15, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/223
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Determining the end of an utterance for purposes of automatic speech recognition (ASR) may be improved with a system that provides early results and/or incorporates semantic tagging. Early ASR results of an incoming utterance may be prepared based at least in part on an estimated endpoint and processed by a natural language understanding (NLU) process while final results, based at least in part on a final endpoint, are determined. If the early results match the final results, the early NLU results are already prepared for early execution. The endpoint may also be determined based at least in part on the content of the utterance, as represented by semantic tagging output from ASR processing. If the tagging indicate completion of a logical statement, an endpoint may be declared, or a threshold for silent frames prior to declaring an endpoint may be adjusted.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.