Patent · US Active

Crowd sourcing audio transcription via re-speaking

US9418660B2 · kind B2 · utility

0Cited by

1References

11Claims

0Family size

Assignee

Cisco Technology, Inc. · US

Inventors

Matthias Paulik · San Jose, US
Vivek Halder · Cupertino, US
Ananth Sankar · Palo Alto, US

Key dates

Filing date	Jan 15, 2014
Grant date	Aug 16, 2016
Priority date	—
Expiry date	Jun 6, 2034

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/87
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Speech audio that is intended for transcription into textual form is received. The received speech audio is divided into first speech segments. A plurality of speakers is identified. A speaker is configured for repeating in spoken form a first speech segment that the speaker has listened to. A subset of speakers is determined for sending each first speech segment. Each first speech segment is sent to the subset of speakers determined for the particular first speech segment. The second speech segments are received from the speakers. The second speech segment is a re-spoken version of a first speech segment that has been generated by a speaker by repeating in spoken form the first speech segment. The second speech segments are processed to generate partial transcripts. The partial transcripts are combined to generate a complete transcript that is a textual representation corresponding to the received speech audio.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.