Patent · US Expired

Weighting method for use in information extraction and abstracting, based on the frequency of occurrence of keywords and similarity calculations

US6240378A · kind A · utility

37Cited by
17References
10Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 14, 1998
Grant dateMay 29, 2001
Priority date
Expiry dateDec 14, 2018

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/53
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

An information abstracting method and apparatus for extracting and displaying keywords as an information abstract. Given a large number of character string data sets divided into prescribed units, the extracted keywords are significant and effective in describing a topic common to the plurality of units. The information abstracting apparatus comprises an input section for accepting an input of character string data divided into prescribed units, with each individual character represented by a character code, and an output section for displaying the result of information abstracting. Keywords contained in each of the prescribed units are extracted by a keyword extracting section from the character string input data from the input section. A score is calculated for each keyword by a score calculating section, so that a higher score is given to a keyword extracted from a larger number of units. On the basis of the calculated scores, keywords are selected by an abstracting section and are outputted as an information abstract by the output section.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.