Methods and apparatus relating to searching of spoken audio data
US8694317B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 6, 2006 |
| Grant date | Apr 8, 2014 |
| Priority date | — |
| Expiry date | Oct 21, 2030 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/025
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods for processing audio data containing speech to produce a searchable index file and for subsequently searching such an index file are provided. The processing method uses a phonetic approach and models each frame of the audio data with a set of reference phones. A score for each of the reference phones, representing the difference of the audio from the phone model, is stored in the searchable data file for each of the phones in the reference set. A consequence of storing information regarding each of the reference phones is that the accuracy of searches carried out on the index file is not compromised by the rejection of information about particular phones. A subsequent search method is also provided which uses a simple and efficient dynamic programming search to locate instances of a search term in the audio. The methods of the present invention have particular application to the field of audio data mining.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.