Patent · US Expired

Method for finding large numbers of keywords in continuous text streams

US6311183A · kind A · utility

33Cited by
0References
3Claims
0Family size

Assignee

Inventor

Key dates

Filing dateJul 14, 1999
Grant dateOct 30, 2001
Priority date
Expiry dateJul 14, 2019

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99936
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method of full-text scanning for matches in a large dictionary of keywords is described, suitable for SDI (selective dissemination of information). The method is applicable to large dictionaries (hundreds of thousands of entries) and to arbitrary byte sequences for both patterns and sample streams. The approach employs Boyer-Moore-Horspool skipping, extended to pattern collections and digrams, followed by an n-gram hash test, which also identifies a subset of feasible keywords for conventional pattern matching at each location of a putative match.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.