Patent · US Active

Identifying confidential data in a data item by comparing the data item to similar data items from alternative sources

US9489376B2 · kind B2 · utility

12Cited by
9References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 2, 2013
Grant dateNov 8, 2016
Priority date
Expiry dateDec 15, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F21/6245
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method, apparatus and computer program product to identify confidential information in a document. To examine a document for inclusion of confidential information, the document is compared against documents having similar structure and content from one or more other sources. When comparing documents (of similar structure and content) from different sources, confidential information is then made to stand out by searching for terms (from the sources) that are not shared between or among them. In contrast, common words or terms that are shared across the sources are ignored as likely being non-confidential information; what remains as not shared may then be classified as confidential information and protected accordingly (e.g., by omission, redaction, substitution or the like). Using this technique, non-confidential information may be safely segmented from confidential information in a dynamic, automated manner.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.