Patent · US Active

System and method for identifying phrases in text

US8949111B2 · kind B2 · utility

1Cited by
9References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateDec 14, 2011
Grant dateFeb 3, 2015
Priority date
Expiry dateApr 5, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/295
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method includes accessing text that includes a plurality of words, tagging each of the plurality of words with one of a plurality of parts of speech (POS) tags, and creating a plurality of tokens, each token comprising one of the plurality of words and its associated POS tag. The method further includes clustering one or more of the created tokens into a chunk of tokens, the one or more tokens clustered into the chunk of tokens based on the POS tags of the one or more tokens, and forming a phrase based on the chunk of tokens, the phrase comprising the words of the one or more tokens clustered into the chunk of tokens.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.