Patent · US Active

Spoken language recognition

US12406656B2 · kind B2 · utility

0Cited by

0References

20Claims

0Family size

Assignee

Adobe Inc. · US

Inventors

Oriol NIETO-CABALLERO · Oakland, US
Zeyu Jin · San Francisco, US
Justin Salamon · San Francisco, US
Franck Dernoncourt · Sunnyvale, US

Key dates

Filing date	Feb 1, 2023
Grant date	Sep 2, 2025
Priority date	—
Expiry date	Mar 10, 2044

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Some aspects of the technology described herein employ a neural network with an efficient and lightweight architecture to perform spoken language recognition. Given an audio signal comprising speech, features are generated from the audio signal, for instance, by converting the audio signal to a normalized spectrogram. The features are input to the neural network, which has one or more convolutional layers and an output activation layer. Each neuron of the output activation layer corresponds to a language from a set of language and generates an activation value. Based on the activations values, an indication of zero or more languages from the set of languages is provided for the audio signal.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.