Multi-channel voice activity detection
US12154547B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 21, 2023 |
| Grant date | Nov 26, 2024 |
| Priority date | — |
| Expiry date | Sep 21, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2021/02166
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for multi-channel voice activity detection includes receiving a sequence of input frames characterizing streaming multi-channel audio captured by an array of microphones. Each channel of the streaming multi-channel audio includes respective audio features captured by a separate dedicated microphone. The method also includes determining, using a location fingerprint model, a location fingerprint indicating a location of a source of the multi-channel audio relative to the user device based on the respective audio features of each channel of the multi-channel audio. The method also includes generating an output from an application-specific classifier. The first score indicates a likelihood that the multi-channel audio corresponds to a particular audio type that the particular application is configured to process. The method also includes determining whether to accept or reject the multi-channel audio for processing by the particular application based on the first score generated as output from the application-specific classifier.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.