Fingerprinting based entity extraction
US7950062B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 3, 2007 |
| Grant date | May 24, 2011 |
| Priority date | — |
| Expiry date | Mar 21, 2030 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F21/55
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system (and a method) is disclosed for fingerprinting based entity extraction using a rolling hash technique. The system is configured to receive an input stream of a predetermined length comprising characters, and a hash table having indexed entries. The system isolates, through a defined fixed window length, a set of characters of the input stream. A hash key is generated and used to index into the hash table. The system compares the isolated set of characters of the input stream with the entry corresponding to the index into the hash table to determine whether there is an exact match with the entry. The system slides the fixed window length one character to isolate another set of characters of the input stream in response to no exact match from the comparison. Alternatively, the system stores the input stream in response to an exact match from the comparison.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.