Patent · US Active

Crowd sourcing audio transcription via re-speaking

US9418660B2 · kind B2 · utility

0Cited by
1References
11Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 15, 2014
Grant dateAug 16, 2016
Priority date
Expiry dateJun 6, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/87
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Speech audio that is intended for transcription into textual form is received. The received speech audio is divided into first speech segments. A plurality of speakers is identified. A speaker is configured for repeating in spoken form a first speech segment that the speaker has listened to. A subset of speakers is determined for sending each first speech segment. Each first speech segment is sent to the subset of speakers determined for the particular first speech segment. The second speech segments are received from the speakers. The second speech segment is a re-spoken version of a first speech segment that has been generated by a speaker by repeating in spoken form the first speech segment. The second speech segments are processed to generate partial transcripts. The partial transcripts are combined to generate a complete transcript that is a textual representation corresponding to the received speech audio.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.