Patent · US Active

Intermediate data for inter-device speech processing

US11721347B1 · kind B1 · utility

3Cited by

2References

20Claims

0Family size

Assignee

AMAZON TECHNOLOGIES, INC. · US

Inventors

Stanislaw Ignacy Pasko · Luboń, PL
Pawel Zelazko · Gdańsk, PL
Cagdas Bak · Gdańsk, PL
Eli Joshua Fidler · Toronto, CA
Michal Kowalczuk · Gdańsk, PL
Andrew Oberlin · Lynnwood, US
Ariya Rastrow · Seattle, US

Key dates

Filing date	Jun 29, 2021
Grant date	Aug 8, 2023
Priority date	—
Expiry date	Jan 20, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/088
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Some speech processing systems may handle some commands on-device rather than sending the audio data to a second device or system for processing. The first device may have limited speech processing capabilities sufficient for handling common language and/or commands, while the second device (e.g., an edge device and/or a remote system) may call on additional language models, entity libraries, skill components, etc. to perform additional tasks. An intermediate data generator may facilitate dividing speech processing operations between devices by generating a stream of data that includes a first-pass ASR output (e.g., a word or sub-word lattice) and other characteristics of the audio data such as whisper detection, speaker identification, media signatures, etc. The second device can perform the additional processing using the data stream; e.g., without using the audio data. Thus, privacy may be enhanced by processing the audio data locally without sending it to other devices/systems.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.