Patent · US Active

System and method for reinforcement learning based controlled natural language generation

US11586830B2 · kind B2 · utility

3Cited by
0References
13Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 3, 2020
Grant dateFeb 21, 2023
Priority date
Expiry dateMay 26, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system for reinforcement learning based controlled natural language generation is disclosed. The system includes a token generator subsystem to generate an initial output phrase including a sequence of output tokens. The system includes trained models associated with corresponding predefined tasks. Each trained model includes an attention layer to compute attention-based weights for each output token. The trained models include a scoring layer to generate a phrase sequence level score for the output phrase. The trained models include a reward generation layer to generate dense rewards for each output token based on the attention-based weights and the phrase sequence level score. The trained models include a feedback score generation layer to generate a feedback score based on the dense rewards and reward weights assigned to the dense rewards of the corresponding trained models. The feedback score generation layer provides the feedback score iteratively to the token generator subsystem.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.