Weighting method for use in information extraction and abstracting, based on the frequency of occurrence of keywords and similarity calculations
US6240378A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Dec 14, 1998 |
| Grant date | May 29, 2001 |
| Priority date | — |
| Expiry date | Dec 14, 2018 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/53
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An information abstracting method and apparatus for extracting and displaying keywords as an information abstract. Given a large number of character string data sets divided into prescribed units, the extracted keywords are significant and effective in describing a topic common to the plurality of units. The information abstracting apparatus comprises an input section for accepting an input of character string data divided into prescribed units, with each individual character represented by a character code, and an output section for displaying the result of information abstracting. Keywords contained in each of the prescribed units are extracted by a keyword extracting section from the character string input data from the input section. A score is calculated for each keyword by a score calculating section, so that a higher score is given to a keyword extracted from a larger number of units. On the basis of the calculated scores, keywords are selected by an abstracting section and are outputted as an information abstract by the output section.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.