Patent · US Active

Extracting lexical features for classifying native and non-native language usage style

US8170868B2 · kind B2 · utility

33Cited by
7References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 14, 2006
Grant dateMay 1, 2012
Priority date
Expiry dateJul 4, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/20
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A corpus is provided of language usage by non-native users of the language. Characteristics of the corpus are measured and used to create a language usage classifier for indicating non-native usage of the language. Once the language usage classifier is created, a natural language input may be entered, and the characteristics thereof measured. These characteristics are then compared with the indicators of non-native usage, thereby detecting non-native usage. The evaluation of non-native usage may be used as a versatile foundation to enhance a wide variety of tools and applications dealing with user interaction in languages other than their native language.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.