Inferring a dataset schema from input files
US10204119B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 20, 2017 |
| Grant date | Feb 12, 2019 |
| Priority date | — |
| Expiry date | Sep 18, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F3/0638
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for generating a schema for a data input file are described herein. In an embodiment, a server computer receives a data input file. The server computer system selects a sample excerpt from the data input which comprises a subset of the data input file. The server computer system analyzes the sample excerpt to determine a row delimiter for the data input file, a column delimiter for the data input file, and a plurality of data format types. Using the column delimiter, row delimiter, and plurality of data format types, the server computer system generates a candidate schema for the data input file.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.