Patent · US Active

Speech-to-text processing based on a time-ordered classification of audio file segments

US8423361B1 · kind B1 · utility

13Cited by

6References

27Claims

0Family size

Assignee

Adobe Systems Incorporated · US

Inventors

Walter Chang · San Jose, US
Michael J. Welch · Los Angeles, US

Key dates

Filing date	Mar 14, 2012
Grant date	Apr 16, 2013
Priority date	—
Expiry date	Mar 14, 2032

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/81
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

This specification describes technologies relating to multi core processing for parallel speech-to-text processing. In some implementations, a computer-implemented method is provided that includes the actions of receiving an audio file; analyzing the audio file to identify portions of the audio file as corresponding to one or more audio types; generating a time-ordered classification of the identified portions, the time-ordered classification indicating the one or more audio types and position within the audio file of each portion; generating a queue using the time-ordered classification, the queue including a plurality of jobs where each job includes one or more identifiers of a portion of the audio file classified as belonging to the one or more speech types; distributing the jobs in the queue to a plurality of processors; performing speech-to-text processing on each portion to generate a corresponding text file; and merging the corresponding text files to generate a transcription file.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.