Speech-to-text processing based on a time-ordered classification of audio file segments
US8423361B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 14, 2012 |
| Grant date | Apr 16, 2013 |
| Priority date | — |
| Expiry date | Mar 14, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/81
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
This specification describes technologies relating to multi core processing for parallel speech-to-text processing. In some implementations, a computer-implemented method is provided that includes the actions of receiving an audio file; analyzing the audio file to identify portions of the audio file as corresponding to one or more audio types; generating a time-ordered classification of the identified portions, the time-ordered classification indicating the one or more audio types and position within the audio file of each portion; generating a queue using the time-ordered classification, the queue including a plurality of jobs where each job includes one or more identifiers of a portion of the audio file classified as belonging to the one or more speech types; distributing the jobs in the queue to a plurality of processors; performing speech-to-text processing on each portion to generate a corresponding text file; and merging the corresponding text files to generate a transcription file.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.