Adaptation of a speech recognition system across multiple remote sessions with a speaker
US6766295B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | May 10, 1999 |
| Grant date | Jul 20, 2004 |
| Priority date | — |
| Expiry date | May 10, 2019 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/0638
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A technique for adaptation of a speech recognizing system across multiple remote communication sessions with a speaker. The speaker can be a telephone caller. An acoustic model is utilized for recognizing the speaker's speech. Upon initiation of a first remote session with the speaker, the acoustic model is speaker-independent. During the first session, the speaker is uniquely identified and speech samples are obtained from the speaker. In the preferred embodiment, the samples are obtained without requiring the speaker to engage in a training session. The acoustic model is then modified based upon the samples thereby forming a modified model. The model can be modified during the session or after the session is terminated. Upon termination of the session, the modified model is then stored in association with an identification of the speaker. During a subsequent remote session, the speaker is identified and, then, the modified acoustic model is utilized to recognize the speaker's speech. Additional speech samples are obtained during the subsequent session and, then, utilized to further modify the acoustic model. In this manner, an acoustic model utilized for recognizing the speech of…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.