Patent · US Active

Data normalization using data edge platform

US11762876B2 · kind B2 · utility

1Cited by
6References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 4, 2021
Grant dateSep 19, 2023
Priority date
Expiry dateOct 4, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/221
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed are system and methods for processing and storing data files, using a data edge file format. The data edge file format separates information about what symbols are in a data file and information about the corresponding location of those symbols in the data file. Examples convert a source file comprising symbols into a data edge index having a manifest portion, a symbol portion, and a locality portion. The symbol portion contains a sorted unique set of symbols from the source file, and the locality portion contains a plurality of location values referencing the symbol portion. Examples include normalizing structured data from the source file by modifying the locality manifest portion of the data edge file to include a description of at least one nonexistent column empty locality value at a respective position within the locality file representing an omission of data at an associated position in the source file.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.