Inferring a dataset schema from input files
US10540333B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 5, 2018 |
| Grant date | Jan 21, 2020 |
| Priority date | — |
| Expiry date | Dec 5, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/205
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for generating a schema for a data input file are described herein. In an embodiment, a server computer receives a data input file. The server computer system selects a sample excerpt from the data input which comprises a subset of the data input file. The server computer system analyzes the sample excerpt to determine a row delimiter for the data input file, a column delimiter for the data input file, and a plurality of data format types. Using the column delimiter, row delimiter, and plurality of data format types, the server computer system generates a candidate schema for the data input file.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.