Generating additional training data for a natural language understanding engine
US10140977B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 31, 2018 |
| Grant date | Nov 27, 2018 |
| Priority date | — |
| Expiry date | Jul 31, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/225
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating additional training data for a natural language understanding engine. One of the methods includes: obtaining data identifying (i) a first input conversational turn and (ii) a first annotation, determining that the first annotation accurately characterized the first input conversational turn, determining that the natural language understanding engine is likely to generate inaccurate annotations of other conversational turns that are similar to the first input conversational turn, in response to the determining, obtaining one or more first paraphrases of the first input conversational turn; and generating, for each of the one or more first paraphrases, a respective first training example that identifies the first annotation as the correct annotation for the first paraphrase; and training the natural language understanding engine on at least the first training examples.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.