Patent · US Active

Corpus expansion using lexical signatures

US11416562B1 · kind B1 · utility

6Cited by
9References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 23, 2021
Grant dateAug 16, 2022
Priority date
Expiry dateApr 27, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/284
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In an approach to corpus expansion using lexical signatures, one or more computer processors retrieve a donor corpus of text, wherein the donor corpus includes a plurality of documents. One or more computer processors generate a document signature for each of the plurality of documents in the donor corpus. One or more computer processors retrieve a target corpus of text for expansion. One or more computer processors generate a corpus signature for the target corpus. One or more computer processors compare each document signature to the corpus signature. Based on the comparison, one or more computer processors determine a similarity score for each document signature. One or more computer processors rank the plurality of documents by the similarity score. One or more computer processors add one or more top-ranked documents of the plurality of documents to the target corpus.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.