Patent · US Expired

Identifying history of modification within large collections of unstructured data

US7490116B2 · kind B2 · utility

7Cited by
7References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 17, 2003
Grant dateFeb 10, 2009
Priority date
Expiry dateDec 17, 2023

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99956
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A technique for efficient representation of dependencies between electronically-stored documents, such as in an enterprise data processing system. A document distribution path is developed as a directional graph that is a representation of the historic dependencies between documents, which is constructed in real time as documents are created. The system preferably maintains a lossy hierarchical representation of the documents indexed in such a way that allows for fast queries for similar but not necessarily equivalent documents. A distribution path, coupled with a document similarity service, can be used to provide a number of applications, such as a security solution that is capable of finding and restricting access to documents that contain information that is similar to other existing files that are known to contain sensitive information.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.