Augmentation of audiographic images for improved machine learning
US11816577B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 28, 2021 |
| Grant date | Nov 14, 2023 |
| Priority date | — |
| Expiry date | Sep 28, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2021/0135
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Generally, the present disclosure is directed to systems and methods that generate augmented training data for machine-learned models via application of one or more augmentation techniques to audiographic images that visually represent audio signals. In particular, the present disclosure provides a number of novel augmentation operations which can be performed directly upon the audiographic image (e.g., as opposed to the raw audio data) to generate augmented training data that results in improved model performance. As an example, the audiographic images can be or include one or more spectrograms or filter bank sequences.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.