Patent · US Active

Augmentation of audiographic images for improved machine learning

US11138471B2 · kind B2 · utility

1Cited by
1References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 20, 2019
Grant dateOct 5, 2021
Priority date
Expiry dateDec 12, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/0135
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Generally, the present disclosure is directed to systems and methods that generate augmented training data for machine-learned models via application of one or more augmentation techniques to audiographic images that visually represent audio signals. In particular, the present disclosure provides a number of novel augmentation operations which can be performed directly upon the audiographic image (e.g., as opposed to the raw audio data) to generate augmented training data that results in improved model performance. As an example, the audiographic images can be or include one or more spectrograms or filter bank sequences.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.