Reinforcement learning for guaranteed delivery of supplemental content
US11546665B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 7, 2021 |
| Grant date | Jan 3, 2023 |
| Priority date | — |
| Expiry date | May 7, 2041 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04N21/812
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
In some embodiments, a method receives a request for supplemental content to be provided in association with main content. The method selects an instance of supplemental content based on a long-term reward metric and a short-term reward metric. The long-term reward metric is based on feedback from delivery of a plurality of instances of supplemental content and a delivery status for a delivery constraint of one instance of supplemental content. The short-term reward metric is based on feedback from delivery of the one instance of supplemental content. The long-term reward metric is based on feedback from delivery of a plurality of instances of supplemental content and the short-term reward metric is based on feedback from delivery of one instance of supplemental content. The instance of supplemental content is provided to a client device.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.