Patent · US Active

Method and system for document similarity analysis

US12189693B2 · kind B2 · utility

0Cited by
1References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 30, 2023
Grant dateJan 7, 2025
Priority date
Expiry dateJun 30, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/383
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for document similarity analysis. The method includes generating a reference document content identifier for a reference document, including identifying frequently occurring terms in reference document content, encoding each frequently occurring term in a term identifier and combining the term identifiers to form the reference document content identifier associated with the reference document. The method also includes obtaining at least one document similarity value by comparing the reference document content identifier to a set of archived document content identifiers stored in a document repository.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.