Patent · US Active

System and method for automatically summarizing documents pertaining to a predefined domain

US11074303B2 · kind B2 · utility

2Cited by
3References
11Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 21, 2018
Grant dateJul 27, 2021
Priority date
Expiry dateMar 9, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N5/022
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed is a system for automatically summarizing documents pertaining to a predefined domain. A document finder module enables a web crawler to crawl web resources in order to find a plurality of documents. A keyword determination module determines a set of keywords from the plurality of documents and a rank associated to each keyword of the set of keywords. A clustering module clusters the plurality of documents into one or more clusters. A score computation module identifies a subset of the set of keywords for each cluster upon computing a similarity score, corresponding to each keyword, for each cluster. A summary generation module generates a summary for each cluster based on presence of one or more keywords, of the subset, in each document classified in the cluster.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.