Speaker recognition method through emotional model synthesis based on neighbors preserving principle
US9355642B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 4, 2012 |
| Grant date | May 31, 2016 |
| Priority date | — |
| Expiry date | Mar 20, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/063
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A speaker recognition method through emotional model synthesis based on Neighbors Preserving Principle is enclosed. The methods includes the following steps: (1) training the reference speaker's and user's speech models; (2) extracting the neutral-to-emotion transformation/mapping sets of GMM reference models; (3) extracting the emotion reference Gaussian components mapped by or corresponding to several Gaussian neutral reference Gaussian components close to the user's neutral training Gaussian component; (4) synthesizing the user's emotion training Gaussian component and then synthesizing the user's emotion training model; (5) synthesizing all user's GMM training models; (6) inputting test speech and conducting the identification. This invention extracts several reference speeches similar to the neutral training speech of a user from a speech library by employing neighbor preserving principles based on KL divergence and combines an emotion training speech of the user using the emotion reference speech in the reference speech, improving the performance of the speaker recognition system in the situation where the training speech and the test speech are mismatched, and the robustness…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.