Patent · US Active

Recognition of speech in editable audio streams

US7869996B2 · kind B2 · utility

17Cited by

5References

24Claims

0Family size

Assignee

Multimodal Technologies, LLC · US

Inventors

Eric Carraux · Pittsburgh, US
Detlef Koll · Pittsburgh, US

Key dates

Filing date	Nov 23, 2007
Grant date	Jan 11, 2011
Priority date	—
Expiry date	Aug 12, 2029

Classification

Technology area (CPC G)Physics
CPC primaryG11B27/105
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

A speech processing system divides a spoken audio stream into partial audio streams, referred to as “snippets.” The system may divide a portion of the audio stream into two snippets at a position at which the speaker performed an editing operation, such as pausing and then resuming recording, or rewinding and then resuming recording. The snippets may be transmitted sequentially to a consumer, such as an automatic speech recognizer or a playback device, as the snippets are generated. The consumer may process (e.g., recognize or play back) the snippets as they are received. The consumer may modify its output in response to editing operations reflected in the snippets. The consumer may process the audio stream while it is being created and transmitted even if the audio stream includes editing operations that invalidate previously-transmitted partial audio streams, thereby enabling shorter turnaround time between dictation and consumption of the complete audio stream.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.