Processing row data for deduplication based on corresponding row numbers
US11734239B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 15, 2022 |
| Grant date | Aug 22, 2023 |
| Priority date | — |
| Expiry date | Mar 15, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/2379
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A record processing and storage system is operable to receive a plurality of labeled row data from a data source. Each labeled row data of the plurality of labeled row data includes at least one record and a corresponding row number of a plurality of row numbers. A plurality of pages are generated from records included in the labeled row data. The plurality of pages are stored via a page storage system. A plurality of page metadata corresponding to the plurality of pages is generated, where each of the plurality of page metadata is generated based on at least corresponding one row number of at least one labeled row data with records included in a corresponding one of the plurality of pages. Deduplication of duplicated records included the plurality of pages is facilitated based on the plurality of page metadata.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.