Audio file annotation
US11893990B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Sep 27, 2021 |
| Grant date | Feb 6, 2024 |
| Priority date | — |
| Expiry date | Jul 1, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/223
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Text-to-speech translation is used to generate a transcript for an audio file. Text segments are associated with time segments in the transcript. A trained machine learning model determines, based on the text in the transcript, one or more topics for the audio file. The transcript is modified to include the determined one or more topics. A user interface may be presented that allows a user to search for portions of an audio file that relate to a particular topic. In response to the selected or entered topic, the user interface presents segments having a matching topic. The user may use voice or other user interface commands to modify the annotation of the audio file. User commands may also be used to extract data from the transcript and copy the data to a clipboard or to another application.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.