Systems and methods for audio transcription switching based on real-time identification of languages in an audio stream
US12361924B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 28, 2022 |
| Grant date | Jul 15, 2025 |
| Priority date | — |
| Expiry date | Nov 30, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/063
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed is a multi-language translation system and associated methods that adapt to users speaking different languages, and that convert each spoken language to a target language. The system trains a neural network using audio of different speakers speaking different languages, and generates vectors with different sets of audio features that identify each of the different languages. The system receives an audio stream, transcribes a first snippet from a first language to the target language based on a first vector classifying the first audio snippet features to the first language, transcribes a second audio snippet from a new language to the target language based on the first vector being unable to classify the second audio snippet features to the first language, and transcribes a third audio snippet from a second language to the target language based on a second vector classifying the third audio snippet to the second language.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.