Method for finding large numbers of keywords in continuous text streams
US6311183A · kind A · utility
Assignee
Inventor
Key dates
| Filing date | Jul 14, 1999 |
| Grant date | Oct 30, 2001 |
| Priority date | — |
| Expiry date | Jul 14, 2019 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99936
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method of full-text scanning for matches in a large dictionary of keywords is described, suitable for SDI (selective dissemination of information). The method is applicable to large dictionaries (hundreds of thousands of entries) and to arbitrary byte sequences for both patterns and sample streams. The approach employs Boyer-Moore-Horspool skipping, extended to pattern collections and digrams, followed by an n-gram hash test, which also identifies a subset of feasible keywords for conventional pattern matching at each location of a putative match.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.