Neural pitch-shifting and time-stretching
US11915714B2 · kind B2 · utility
Assignees
Inventors
Key dates
| Filing date | Dec 21, 2021 |
| Grant date | Feb 27, 2024 |
| Priority date | — |
| Expiry date | Jan 11, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2021/0135
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods for modifying audio data include operations for accessing audio data having a first prosody, receiving a target prosody differing from the first prosody, and computing acoustic features representing samples. Computing respective acoustic features for a sample includes computing a pitch feature as a quantized pitch value of the sample by assigning a pitch value, of the target prosody or the audio data, to at least one of a set of pitch bins having equal widths in cents. Computing the respective acoustic features further includes computing a periodicity feature from the audio data. The respective acoustic features for the sample include the pitch feature, the periodicity feature, and other acoustic features. A neural vocoder is applied to the acoustic features to pitch-shift and time-stretch the audio data from the first prosody toward the target prosody.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.