Distributed endpointing for speech recognition
US9818407B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 7, 2013 |
| Grant date | Nov 14, 2017 |
| Priority date | — |
| Expiry date | Jul 14, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/87
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An efficient audio streaming method and apparatus includes a client process implemented on a client or local device and a server process implemented on a remote server or server(s). The client process and server process each have speech recognition components and communicate over a network, and together efficiently manage the detection of speech in an audio signal streamed by the local device to the server for speech recognition and potentially further processing at the server. The client process monitors audio input and in a first detection stage, implements endpointing on the local device to determine when speech is detected. The client process may further determine if a “wakeword” is detected, and then the client process opens a connection and begins streaming audio to the server process via the network. The server process receives the speech audio stream and monitors the audio, implementing endpointing in the server process, to determine when to tell the client process to close the connection and stop streaming audio. The client process continues streaming audio to the server until the server process determines disconnect criteria have been met and tells the client process to s…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.