Systems and methods for aligning lyrics using a neural network
US11475887B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 21, 2019 |
| Grant date | Oct 18, 2022 |
| Priority date | — |
| Expiry date | May 31, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/183
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An electronic device receives audio data for a media item. The electronic device generates, from the audio data, a plurality of samples, each sample having a predefined maximum length. The electronic device, using a neural network trained to predict textal unit probabilities, generates a probability matrix of textual units for a first portion of a first sample of the plurality of samples. The probability matrix includes information about textual units, timing information, and respective probabilities of respective textual units at respective times. The electronic device identifies, for the first portion of the first sample, a first sequence of textual units based on the generated probability matrix.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.