Patent · US Active

Distributed endpointing for speech recognition

US9818407B1 · kind B1 · utility

162Cited by

14References

10Claims

0Family size

Assignee

AMAZON TECHNOLOGIES, INC. · US

Inventors

Hugh Evan Secker-Walker · Newburyport, US
Kenneth John Basye · Sutton, US
Nikko Strom · Kirkland, US
Ryan Paul Thomas · Redmond, US

Key dates

Filing date	Feb 7, 2013
Grant date	Nov 14, 2017
Priority date	—
Expiry date	Jul 14, 2033

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/87
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

An efficient audio streaming method and apparatus includes a client process implemented on a client or local device and a server process implemented on a remote server or server(s). The client process and server process each have speech recognition components and communicate over a network, and together efficiently manage the detection of speech in an audio signal streamed by the local device to the server for speech recognition and potentially further processing at the server. The client process monitors audio input and in a first detection stage, implements endpointing on the local device to determine when speech is detected. The client process may further determine if a “wakeword” is detected, and then the client process opens a connection and begins streaming audio to the server process via the network. The server process receives the speech audio stream and monitors the audio, implementing endpointing in the server process, to determine when to tell the client process to close the connection and stop streaming audio. The client process continues streaming audio to the server until the server process determines disconnect criteria have been met and tells the client process to s…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.