Patent · US Active

System and method for synthetic audio generation

US11893305B2 · kind B2 · utility

0Cited by
0References
6Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 3, 2022
Grant dateFeb 6, 2024
Priority date
Expiry dateSep 29, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10H2250/395
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments provide a method and system for audio generation from contextual text input is provided. The disclosure gives due importance to the granularity of the content. The system allows the user to specify the properties of the audio to be generated. Here, context is used to identify the importance of a particular sound over the others and thus automatic adjustments of the audio output to give a more realistic feel. The system generates dataset for training audio models. The user can give input query in natural language and the audio requested will be generated for training and developing the necessary classification or other necessary audio models. The system provides a feature of automated fine-tuning of the model parameters to suit the new automatically collected training data. Furthermore, the system provides a pre-trained inbuilt model repository with audio models belonging to the main categories of noises.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.