Patent · US Active

Intelligent training set augmentation for natural language processing tasks

US11599721B2 · kind B2 · utility

0Cited by

0References

20Claims

0Family size

Assignee

Salesforce, Inc. · US

Inventors

Shiva Kumar Pentyala · Mountain View, US
Mridul Gupta · San Mateo, US
Ankit Chadha · San Jose, US
Indira Iyer · Cupertino, US
Richard Socher · Menlo Park, US

Key dates

Filing date	Aug 25, 2020
Grant date	Mar 7, 2023
Priority date	—
Expiry date	May 26, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG06F40/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A natural language processing system that trains task models for particular natural language tasks programmatically generates additional utterances for inclusion in the training set, based on the existing utterances in the training set and the existing state of a task model as generated from the original (non-augmented) training set. More specifically, the training augmentation module 220 identifies specific textual units of utterances and generates variants of the utterances based on those identified units. The identification is based on determined importances of the textual units to the output of the task model, as well as on task rules that correspond to the natural language task for which the task model is being generated. The generation of the additional utterances improves the quality of the task model without the expense of manual labeling of utterances for training set inclusion.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.