Patent · US Active

Computer system, method, and computer program for extracting terms from document data including text segment

US8463794B2 · kind B2 · utility

2Cited by
12References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 30, 2009
Grant dateJun 11, 2013
Priority date
Expiry dateMar 5, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/284
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A computer system, method, and article of manufacture for extracting a term from electronic document data that includes a text segment. The system includes: a first extraction unit that uses a first text processing information to extract a noun word from the document data; a second extraction unit that uses a second text processing information to extract a term candidate in relation to the noun word or a corpus that includes text data described in the same language used in the document data; a weight assignment unit that uses a third text processing information to select which type to assign a weight from the plurality of types and assigns the weight to the selected type for each noun word and term candidate; a determination unit that determines the type to which the noun word and term candidate belong; and an output unit to output the noun word and term candidate.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.