Techniques for improved audio processing using acoustic and language identification models
US12236940B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Sep 20, 2022 |
| Grant date | Feb 25, 2025 |
| Priority date | — |
| Expiry date | Aug 10, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/0635
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for audio processing. A method includes tuning hyperparameters of an acoustic model based on outputs of a language identification (LID) model for a training audio data set and outputs of the acoustic model for the training audio data set; applying the LID model to a first set of features extracted from a processing audio data set in order to produce outputs of the LID model for the processing audio data set; and applying the acoustic model to a second set of features extracted from the processing audio data set and the outputs of the LID model in order to produce outputs of the acoustic model for the processing audio data set.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.