Patent · US Active

Device for automatically detecting morpheme part of speech tagging corpus error by using rough sets, and method therefor

US11074406B2 · kind B2 · utility

0Cited by
0References
10Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 29, 2017
Grant dateJul 27, 2021
Priority date
Expiry dateDec 14, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/268
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A device for detecting a morpheme tagging corpus error, of the present invention, includes: an attribute generating unit for generating attributes for word phrases included in an input corpus, by using a kernel to which a rough set theory is applied; and an attribute statistics processing unit for generating part-of-speech tagging corpus error data through the calculation of attributes and frequency count for the same word phrases by counting attributes for the same word phrase among the word phrases, and thus the present invention can detect, quantify, and modify errors included in a corpus (learning data) required in learning for classifier generation and recognition for natural language processing.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.