Deep learning-based automatic detection and labeling of dynamic advertisements in long-form audio content
US12190871B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 7, 2021 |
| Grant date | Jan 7, 2025 |
| Priority date | — |
| Expiry date | Jun 21, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/26
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques and methods are disclosed for detecting long-form audio content in one or more audio files. A computing system receives first audio data corresponding to a first version of an audio file and second audio data corresponding to a second version of the audio file. The computing system generates a first transcript of the first audio data and a second transcript of the second audio data. The computing system compares the first audio data and the second audio data and the first transcript and the second transcript to identify advertisement portions and content portions of the audio data. Using a semantic model based on a machine learning (ML) transformer, the computing system can determine advertisement segments within the advertisement portions, the advertisement segments corresponding to separate advertisements. Information corresponding to the duration and location of the advertisement segments is stored in a data store of the computing system.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.