Patent · US Active

Determining an extraction rule from positive and negative examples

US11042697B2 · kind B2 · utility

2Cited by
118References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 1, 2019
Grant dateJun 22, 2021
Priority date
Expiry dateOct 1, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/279
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The technology disclosed relates to formulating and refining field extraction rules that are used at query time on raw data with a late-binding schema. The field extraction rules identify portions of the raw data, as well as their data types and hierarchical relationships. These extraction rules are executed against very large data sets not organized into relational structures that have not been processed by standard extraction or transformation methods. By using sample events, a focus on primary and secondary example events help formulate either a single extraction rule spanning multiple data formats, or multiple rules directed to distinct formats. Selection tools mark up the example events to indicate positive examples for the extraction rules, and to identify negative examples to avoid mistaken value selection. The extraction rules can be saved for query-time use, and can be incorporated into a data model for sets and subsets of event data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.