Patent · US Active

Classification-based redaction in natural language text

US8938386B2 · kind B2 · utility

8Cited by

10References

23Claims

0Family size

Assignee

ACCENTURE GLOBAL SERVICES LIMITED · IE

Inventors

Chad Cumby · Chicago, US
Rayid Ghani · Chicago, US

Key dates

Filing date	Mar 15, 2011
Grant date	Jan 20, 2015
Priority date	—
Expiry date	Nov 20, 2033

Classification

Technology area (CPC G)Physics
CPC primaryG06F40/279
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

When redacting natural language text, a classifier is used to provide a sensitive concept model according to features in natural language text and in which the various classes employed are sensitive concepts reflected in the natural language text. Similarly, the classifier is used to provide an utility concepts model based on utility concepts. Based on these models, and for one or more identified sensitive concept and identified utility concept, at least one feature in the natural language text is identified that implicates the at least one identified sensitive topic more than the at least one identified utility concept. At least some of the features thus identified may be perturbed such that the modified natural language text may be provided as at least one redacted document. In this manner, features are perturbed to maximize classification error for sensitive concepts while simultaneously minimizing classification error in the utility concepts.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.