Patent · US Active

Enhancement of massive data ingestion by similarity linkage of documents

US9916533B2 · kind B2 · utility

0Cited by
5References
9Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 10, 2015
Grant dateMar 13, 2018
Priority date
Expiry dateJan 2, 2036

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/106
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for ingesting a plurality of content according to a statistical similarity of at least one portion of the ingested plurality of content into an information handling system capable of answering questions, whereby the ingested plurality of content is based on a received topic and ingesting the plurality of content comprises ingesting a plurality of documents associated with the received topic is provided. The method may include determining at least one similarity between each document based on a similarity criteria. The method may also include applying a statistical model to characterize the determined at least one similarity between each document. The method may further include creating at least one pair-wise link for each document. The method may additionally include mapping the created at least one pair-wise link. The method may include generating a plurality of rules for ingesting a plurality of additional content.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.