Patent · US Active

Systems and methods for generating summaries of documents

US9317498B2 · kind B2 · utility

16Cited by
3References
12Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 8, 2015
Grant dateApr 19, 2016
Priority date
Expiry dateApr 8, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/295
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods for summarizing online articles for consumption on a user device are disclosed herein. The system extracts the main body of an article's text from the HTML code of an online article. The system may then classify the extracted article into one of several different categories and removes duplicate articles. The system breaks down the article into its component sentences, and each sentence is classified into one of three categories: (1) potential candidate sentences that may be included in the generated summary; (2) weakly rejected sentences that will not be included in the summary but may be used to generate the summary; and (3) strongly rejected sentences that are not included in the summary. Finally, the system applies a document summarizer to generate quickly readable article summaries, for viewing on the user device, using relevant sentences from the article while maintaining the coherence of the article.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.