Patent · US Active

System and method for reinforcement learning based controlled natural language generation

US11586830B2 · kind B2 · utility

3Cited by

0References

13Claims

0Family size

Assignee

PM Labs, Inc. · US

Inventors

Arjun Maheswaran · Mountain View, US
Akhilesh Sudhakar · Chennai, IN
Bhargav Upadhyay · Mangrol, IN

Key dates

Filing date	Jun 3, 2020
Grant date	Feb 21, 2023
Priority date	—
Expiry date	May 26, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG06N20/00
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A system for reinforcement learning based controlled natural language generation is disclosed. The system includes a token generator subsystem to generate an initial output phrase including a sequence of output tokens. The system includes trained models associated with corresponding predefined tasks. Each trained model includes an attention layer to compute attention-based weights for each output token. The trained models include a scoring layer to generate a phrase sequence level score for the output phrase. The trained models include a reward generation layer to generate dense rewards for each output token based on the attention-based weights and the phrase sequence level score. The trained models include a feedback score generation layer to generate a feedback score based on the dense rewards and reward weights assigned to the dense rewards of the corresponding trained models. The feedback score generation layer provides the feedback score iteratively to the token generator subsystem.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.