Patent · US Active

Augmentation of audiographic images for improved machine learning

US11816577B2 · kind B2 · utility

0Cited by
3References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 28, 2021
Grant dateNov 14, 2023
Priority date
Expiry dateSep 28, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/0135
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Generally, the present disclosure is directed to systems and methods that generate augmented training data for machine-learned models via application of one or more augmentation techniques to audiographic images that visually represent audio signals. In particular, the present disclosure provides a number of novel augmentation operations which can be performed directly upon the audiographic image (e.g., as opposed to the raw audio data) to generate augmented training data that results in improved model performance. As an example, the audiographic images can be or include one or more spectrograms or filter bank sequences.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.