Patent · US Expired

Method and apparatus for clustering a collection of linked documents using co-citation analysis

US6038574A · kind A · utility

146Cited by
8References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 18, 1998
Grant dateMar 14, 2000
Priority date
Expiry dateMar 18, 2018

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99943
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The method and apparatus of the present invention generates clusters of documents in a collection of linked documents based on co-citation analysis. The frequency linkage is determined for each document in the collection. In other words, the number of times each document is linked to by another document in the collection is determined. Further, a minimum frequency linkage (link frequency threshold) is specified based on a predetermined minimum frequency of document linkage. Additionally, a list of pairs of documents that are linked to by the same document is created so that each of the pairs of documents has a count of the number of times (co-citation frequency) that they are both linked to by another document. Pairs of linked documents are clustered using a suitable co-citation technique.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.