Field extraction rules from clustered data samples
US11216491B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 30, 2016 |
| Grant date | Jan 4, 2022 |
| Priority date | — |
| Expiry date | Sep 15, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The operation of an automatic data input and query system is controlled by well-defined control data. Certain control data may relate to data schemas and direct operations performed by the system to extract fields from machine data. Automatic methods may determine proper field extraction control information by analyzing a sample of data from a source, breaking the sample data into event segments, classifying the segments into groups based on a measure of similarity, determining an operable extraction rule for each group, and storing the resulting extraction model. Data patterns known by the system can be leveraged to perform the event breaking and field identification for the classifying. Embodiments may provide a user interface to view, interact with, and approve the computer-generated extraction model.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.