Patent · US Active

Indexing for regular expressions in text-centric applications

US8266135B2 · kind B2 · utility

2Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 5, 2009
Grant dateSep 11, 2012
Priority date
Expiry dateJan 27, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/31
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method, system, and article are provided for evaluating regular expressions over large data collections. A general purpose index is built to handle complex regular expressions at the character level. Characters, character classes, and associated metadata are identified and stored in an index of a collection of documents. Given a regular expression, a query is generated based on the contents of the index. This query is executed over the index to identify a set of documents in the collection of documents over which the regular expression can be evaluated. Based upon the query execution, the identified set of documents is returned for evaluation by the regular expression responsive to execution of the query over the index.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.