Patent · US Active

Stop word data augmentation for natural language processing

US11651768B2 · kind B2 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 9, 2020
Grant dateMay 16, 2023
Priority date
Expiry dateFeb 24, 2041

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04L51/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for stop word data augmentation for training chatbot systems in natural language processing. In one particular aspect, a computer-implemented method includes receiving a training set of utterances for training an intent classifier to identify one or more intents for one or more utterances; augmenting the training set of utterances with stop words to generate an augmented training set of out-of-domain utterances for an unresolved intent category corresponding to an unresolved intent; and training the intent classifier using the training set of utterances and the augmented training set of out-of-domain utterances. The augmenting includes: selecting one or more utterances from the training set of utterances, and for each selected utterance, preserving existing stop words within the utterance and replacing at least one non-stop word within the utterance with a stop word or stop word phrase selected from a list of stop words to generate an out-of-domain utterance.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.