Fingerprinting based entity extraction
US8490203B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 11, 2011 |
| Grant date | Jul 16, 2013 |
| Priority date | — |
| Expiry date | Sep 8, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F21/55
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system (and a method) is disclosed for fingerprinting based entity extraction using a rolling hash technique. The system is configured to receive an input stream comprising characters, the input stream of a predetermined length, and a hash table having plurality of indexed entries. The system defines a fixed window length. The system isolates, through the fixed window length, a set of a plurality of characters of the input stream. The system generates a hash key. The hash key is used to index into the hash table. The system compares the isolated set of plurality of characters of the input stream with the entry corresponding to the index into the hash table to determine whether there is an exact match with the entry. The system slides the fixed window length one character to isolate another set of a plurality of characters of the input stream in response to no exact match from the comparison. Alternatively, the system stores the input stream in response to an exact match from the comparison.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.