Enhancement of massive data ingestion by similarity linkage of documents
US9916534B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 9, 2015 |
| Grant date | Mar 13, 2018 |
| Priority date | — |
| Expiry date | Jan 2, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/106
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for ingesting a plurality of content according to a statistical similarity of at least one portion of the ingested plurality of content into an information handling system capable of answering questions, whereby the ingested plurality of content is based on a received topic and ingesting the plurality of content comprises ingesting a plurality of documents associated with the received topic is provided. The method may include determining at least one similarity between each document based on a similarity criteria. The method may also include applying a statistical model to characterize the determined at least one similarity between each document. The method may further include creating at least one pair-wise link for each document. The method may additionally include mapping the created at least one pair-wise link. The method may include generating a plurality of rules for ingesting a plurality of additional content.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.