Patent · US Expired

Method and system for classifying display pages using summaries

US7392474B2 · kind B2 · utility

10Cited by
9References
42Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 30, 2004
Grant dateJun 24, 2008
Priority date
Expiry dateMar 10, 2026

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/951
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method and system for classifying display pages based on automatically generated summaries of display pages. A web page classification system uses a web page summarization system to generate summaries of web pages. The summary of a web page may include the sentences of the web page that are most closely related to the primary topic of the web page. The summarization system may combine the benefits of multiple summarization techniques to identify the sentences of a web page that represent the primary topic of the web page. Once the summary is generated, the classification system may apply conventional classification techniques to the summary to classify the web page. The classification system may use conventional classification techniques such as a Naïve Bayesian classifier or a support vector machine to identify the classifications of a web page based on the summary generated by the summarization system.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.