Low latency audio processing techniques
US12412567B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | May 5, 2021 |
| Grant date | Sep 9, 2025 |
| Priority date | — |
| Expiry date | Jan 26, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/063
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for reducing latency in processing of audio data, where the latency may be caused in detecting audio of interest in the audio data, are described. A device that captures audio data may include a detection component to determine when the audio data includes audio of interest (e.g., device-directed speech), and an audio embedding generator to generate embedding vectors for the captured audio data while the detection component processes the audio data. The device may generate an embedding vector for audio data captured at the device for a duration of time; determine, at the end of the duration of time, that the audio data represents audio of interest; and send the embedding vector to an audio processing component (e.g., an automatic speech recognition component) for processing.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.