Patent · US Active

Systems and methods for identifying collocation errors in text

US8473278B2 · kind B2 · utility

49Cited by
6References
34Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 24, 2009
Grant dateJun 25, 2013
Priority date
Expiry dateOct 26, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/284
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods for detecting collocation errors in a text sample using a reference database from a corpus are provided. Collocation candidates are identified within the text sample based upon syntactic patterns in the text sample. Whether a given collocation candidate contains a collocation error is detected, the detecting including: determining a first association measure using the reference database for the given collocation candidate; determining whether the first association measure satisfies a predetermined condition and identifying the given collocation candidate as proper if the first association measure satisfies the predetermined condition; determining an additional association measure for a variation of the given collocation candidate using the reference database; and determining whether or not the collocation candidate contains an error based upon the additional association measure of the variation.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.