Systems and methods for aligning lyrics using a neural network
US11308943B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 12, 2019 |
| Grant date | Apr 19, 2022 |
| Priority date | — |
| Expiry date | Jan 13, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/226
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An electronic device receives audio data for a media item. The electronic device generates, from the audio data, a plurality of samples, each sample having a predefined maximum length. The electronic device, using a neural network trained to predict character probabilities, generates a probability matrix of characters for a first portion of a first sample of the plurality of samples. The probability matrix includes character information, timing information, and respective probabilities of respective characters at respective times. The electronic device identifies, for the first portion of the first sample, a first sequence of characters based on the generated probability matrix.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.