Data analysis and processing engine
US11182354B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 22, 2019 |
| Grant date | Nov 23, 2021 |
| Priority date | — |
| Expiry date | Jun 19, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/901
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system formats and normalizes data stored in received files. A data processing engine identifies a filetype and format of the file based on samples of the file contents. The system detects a schema of the file to determine the datatypes and locations of data within the file. The schema detection process may depend on the identified filetype of the file. Once a filetype and schema have been determined, the system can reformat data stored within different sections of the file in view of the datatypes. The formatted file is stored in a data lake with other files received by the system. The formatting process can involve normalization of certain datatypes, which facilitates access of the data later by a user querying the data stored in files at the data lake.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.