Patent · US Active

Speaker identity and content de-identification

US11217223B2 · kind B2 · utility

6Cited by

9References

20Claims

0Family size

Assignee

International Business Machines Corporation · US

Inventors

Aris Gkoulalas-Divanis · Waltham, US
Xu Wang · Beijing, CN
Paul R. Bastide · Boxford, US
Rohit Ranchal · Austin, US

Key dates

Filing date	Apr 28, 2020
Grant date	Jan 4, 2022
Priority date	—
Expiry date	Aug 12, 2040

Classification

Technology area (CPC H)Electricity
CPC primaryH04K1/00
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

One embodiment of the invention provides a method for speaker identity and content de-identification under privacy guarantees. The method comprises receiving input indicative of privacy protection levels to enforce, extracting features from a speech recorded in a voice recording, recognizing and extracting textual content from the speech, parsing the textual content to recognize privacy-sensitive personal information about an individual, generating de-identified textual content by anonymizing the personal information to an extent that satisfies the privacy protection levels and conceals the individual's identity, and mapping the de-identified textual content to a speaker who delivered the speech. The method further comprises generating a synthetic speaker identity based on other features that are dissimilar from the features to an extent that satisfies the privacy protection levels, and synthesizing a new speech waveform based on the synthetic speaker identity to deliver the de-identified textual content. The new speech waveform conceals the speaker's identity.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.