Crowd sourcing audio transcription via re-speaking
US9418660B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 15, 2014 |
| Grant date | Aug 16, 2016 |
| Priority date | — |
| Expiry date | Jun 6, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/87
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Speech audio that is intended for transcription into textual form is received. The received speech audio is divided into first speech segments. A plurality of speakers is identified. A speaker is configured for repeating in spoken form a first speech segment that the speaker has listened to. A subset of speakers is determined for sending each first speech segment. Each first speech segment is sent to the subset of speakers determined for the particular first speech segment. The second speech segments are received from the speakers. The second speech segment is a re-spoken version of a first speech segment that has been generated by a speaker by repeating in spoken form the first speech segment. The second speech segments are processed to generate partial transcripts. The partial transcripts are combined to generate a complete transcript that is a textual representation corresponding to the received speech audio.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.