Patent · US Active

Enhancement of massive data ingestion by similarity linkage of documents

US11049024B2 · kind B2 · utility

0Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 21, 2017
Grant dateJun 29, 2021
Priority date
Expiry dateMar 20, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/106
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for ingesting a plurality of content according to a statistical similarity of at least one portion of the ingested plurality of content into an information handling system capable of answering questions, whereby the ingested plurality of content is based on a received topic and ingesting the plurality of content comprises ingesting a plurality of documents associated with the received topic is provided. The method may include determining at least one similarity between each document based on a similarity criteria. The method may also include applying a statistical model to characterize the determined at least one similarity between each document. The method may further include creating at least one pair-wise link for each document. The method may additionally include mapping the created at least one pair-wise link. The method may include generating a plurality of rules for ingesting a plurality of additional content.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.