Patent · US Active

Business data lake search engine

US10795895B1 · kind B1 · utility

9Cited by
2References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 26, 2017
Grant dateOct 6, 2020
Priority date
Expiry dateSep 10, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/9024
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Business Data Lake searching techniques are provided. A method comprises obtaining a graph representing tables of the Business Data Lake, where each node represents one table and edges between nodes represent foreign key connections; applying a node rank algorithm to determine a relevancy score of the tables based on a number of links to/from other tables; and, in response to a query: ranking a relevancy of query items based on a term frequency-based score to generate candidate results; extracting a candidate sub-graph based on the following: a top-L tables based on the term frequency-based score, and/or a top-M tables based on a topic model distance score for the given query and candidate items; enriching the extracted candidate sub-graph by adding new tables using an item-to-item collaborative filter where a similarity between two tables is measured based on a number of interactions; and ordering the tables in the enriched sub-graph based on the relevancy score and/or a user-to-item collaborative filter that evaluates past user interactions with prior results.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.