Patent · US Active

Neural pitch-shifting and time-stretching

US11915714B2 · kind B2 · utility

0Cited by
1References
20Claims
0Family size

Assignees

Inventors

Key dates

Filing dateDec 21, 2021
Grant dateFeb 27, 2024
Priority date
Expiry dateJan 11, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/0135
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods for modifying audio data include operations for accessing audio data having a first prosody, receiving a target prosody differing from the first prosody, and computing acoustic features representing samples. Computing respective acoustic features for a sample includes computing a pitch feature as a quantized pitch value of the sample by assigning a pitch value, of the target prosody or the audio data, to at least one of a set of pitch bins having equal widths in cents. Computing the respective acoustic features further includes computing a periodicity feature from the audio data. The respective acoustic features for the sample include the pitch feature, the periodicity feature, and other acoustic features. A neural vocoder is applied to the acoustic features to pitch-shift and time-stretch the audio data from the first prosody toward the target prosody.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.