Patent · US Active

Advanced field extractor with multiple positive examples

US9753909B2 · kind B2 · utility

22Cited by
5References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 30, 2015
Grant dateSep 5, 2017
Priority date
Expiry dateSep 27, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/2477
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The technology disclosed relates to formulating and refining field extraction rules that are used at query time on raw data with a late-binding schema. The field extraction rules identify portions of the raw data, as well as their data types and hierarchical relationships. These extraction rules are executed against very large data sets not organized into relational structures that have not been processed by standard extraction or transformation methods. By using sample events, a focus on primary and secondary example events help formulate either a single extraction rule spanning multiple data formats, or multiple rules directed to distinct formats. Selection tools mark up the example events to indicate positive examples for the extraction rules, and to identify negative examples to avoid mistaken value selection. The extraction rules can be saved for query-time use, and can be incorporated into a data model for sets and subsets of event data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.