Patent · US Expired

Text summarization using part-of-speech

US6289304A · kind A · utility

140Cited by
13References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateMar 17, 1999
Grant dateSep 11, 2001
Priority date
Expiry dateMar 17, 2019

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/345
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Text is summarized using part-of-speech (POS) data indicating parts of speech for tokens in the text. The POS data can be obtained using input text data defining the text, such as by POS tagging. The POS data can be used to obtain group data indicating groups of tokens of the text, such as verb groups and noun groups. The group data can also indicate, within each group, any tokens that meet a POS based removal criterion. The group data can be used to obtain summarized text data by removing tokens that meet the removal criterion. The original text may be obtained via scanner or video camera from a user's document, and may be recognized to obtain input text data. The summarized text may output as text or as audio pronunciation using a speech synthesizer.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.