System and method for automatically summarizing documents pertaining to a predefined domain
US11074303B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 21, 2018 |
| Grant date | Jul 27, 2021 |
| Priority date | — |
| Expiry date | Mar 9, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N5/022
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed is a system for automatically summarizing documents pertaining to a predefined domain. A document finder module enables a web crawler to crawl web resources in order to find a plurality of documents. A keyword determination module determines a set of keywords from the plurality of documents and a rank associated to each keyword of the set of keywords. A clustering module clusters the plurality of documents into one or more clusters. A score computation module identifies a subset of the set of keywords for each cluster upon computing a similarity score, corresponding to each keyword, for each cluster. A summary generation module generates a summary for each cluster based on presence of one or more keywords, of the subset, in each document classified in the cluster.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.