Text-independent speaker recognition system and method based on acoustic segment matching
US4773093A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Dec 31, 1984 |
| Grant date | Sep 20, 1988 |
| Priority date | — |
| Expiry date | Dec 31, 2004 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L17/02
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The invention provides a method and system for speaker enrollment, as well as for speaker recognition. Speaker enrollment creates for each candidate speaker a set of short acoustic segments, or templates, of phonemic duration. An equal number of templates is derived from every candidate speaker's training utterance. A speaker's template set serves as a model for that speaker. Recognition is accomplished by employing a continuous speech recognition (CSR) system to match the recognition utterance with each speaker's template set in turn. The system selects the speaker whose templates match the recognition utterance most closely, that is, the speaker whose CSR match score is lowest. The method of the invention incorporates the entire training utterance in each speaker model, and explains the entire test utterance. The method of the invention models individual short segments of the speech utterances as well as their long-term statistics. Both static and dynamic speaker characteristics are captured in the speaker models.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.