Patent · US Active

Inferring a dataset schema from input files

US10204119B1 · kind B1 · utility

3Cited by
58References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 20, 2017
Grant dateFeb 12, 2019
Priority date
Expiry dateSep 18, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F3/0638
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for generating a schema for a data input file are described herein. In an embodiment, a server computer receives a data input file. The server computer system selects a sample excerpt from the data input which comprises a subset of the data input file. The server computer system analyzes the sample excerpt to determine a row delimiter for the data input file, a column delimiter for the data input file, and a plurality of data format types. Using the column delimiter, row delimiter, and plurality of data format types, the server computer system generates a candidate schema for the data input file.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.