Patent · US Active

Reinforcement learning for delivery of content

US12096082B2 · kind B2 · utility

0Cited by

1References

20Claims

0Family size

Assignee

HULU, LLC · US

Inventors

Pengfei Gao · Shanghai, CN
Dingming Wu · Beijing, CN
Chunyang Wei · Beijing, CN
Xiaohui Xie · Beijing, CN
Shulei Ma · Beijing, CN

Key dates

Filing date	Dec 6, 2022
Grant date	Sep 17, 2024
Priority date	—
Expiry date	Dec 6, 2042

Classification

Technology area (CPC H)Electricity
CPC primaryH04N21/812
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

In some embodiments, a method determines a reward metric based on feedback for an instance of content. A delivery status for a delivery constraint of the instance of content is applied to the reward metric to generate a constrained reward metric. The method uses the constrained reward metric to train a model. The model is used to select from a plurality of instances of content. One of the plurality of instances of content is selected for delivery using the model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.