Efficient and phased method of processing large collections of electronic data known as “best match first”™ for electronic discovery and other related applications
US8819021B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 28, 2008 |
| Grant date | Aug 26, 2014 |
| Priority date | — |
| Expiry date | May 31, 2029 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/3331
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method of more efficient, phased, iterative processing of very large collections of electronic data for the purposes of electronic discovery and related applications is disclosed. The processing minimally includes: text extraction, and the creation of a keyword search index, but may include many additional stages of processing as well. The method further includes: definition of an initial set of characteristics that correspond to “interesting” data, followed by the iterative completion of processing of this data based on a combination of user feedback on the overall relevance of the documents being processed and the system's assessment of whether or not the data it has recently selected to promote in the processing completion queue has the desired quality and quantity of relevant data. The process continues until all identified data has either been fully processed, or discarded at some intermediate stage of processing as being likely irrelevant. This has the result of effectively finishing the processing much earlier, as the later documents in the processing queue will be increasingly irrelevant.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.