Patent · US Active

Speaker recognition method through emotional model synthesis based on neighbors preserving principle

US9355642B2 · kind B2 · utility

0Cited by

0References

6Claims

0Family size

Assignee

ZHEJIANG UNIVERSITY · CN

Inventors

Zhaohui Wu · Longbeilingcun, CN
Yingchun Yang · Hexi District, CN
Li Chen · Village of La Jolla, US

Key dates

Filing date	Sep 4, 2012
Grant date	May 31, 2016
Priority date	—
Expiry date	Mar 20, 2033

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/063
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A speaker recognition method through emotional model synthesis based on Neighbors Preserving Principle is enclosed. The methods includes the following steps: (1) training the reference speaker's and user's speech models; (2) extracting the neutral-to-emotion transformation/mapping sets of GMM reference models; (3) extracting the emotion reference Gaussian components mapped by or corresponding to several Gaussian neutral reference Gaussian components close to the user's neutral training Gaussian component; (4) synthesizing the user's emotion training Gaussian component and then synthesizing the user's emotion training model; (5) synthesizing all user's GMM training models; (6) inputting test speech and conducting the identification. This invention extracts several reference speeches similar to the neutral training speech of a user from a speech library by employing neighbor preserving principles based on KL divergence and combines an emotion training speech of the user using the emotion reference speech in the reference speech, improving the performance of the speaker recognition system in the situation where the training speech and the test speech are mismatched, and the robustness…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.