Patent · US Active

Method and system for generating natural language training data

US10217059B2 · kind B2 · utility

11Cited by
1References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 4, 2014
Grant dateFeb 26, 2019
Priority date
Expiry dateFeb 4, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/186
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Provided is a system, method and computer-readable medium for generating data that may be used to train models for a natural language processing application. A system architect creates a plurality of sentence patterns that include entity variables and initiates sentence generation. Each entity is associated with one or more entity data sources. A language generator accepts the sentence patterns as inputs, and references the various entity sources to create a plurality of generated sentences. The generated sentences may be associated with a particular class and therefore used to train one or more statistical classification models and entity extraction models for associated models. The sentence generated process may be initiated and controlled using a user interface displayable on a computing device, the user interface in communication with the language generator module.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.