Systems and methods for determining actions depicted in media contents based on attention weights of media content frames
US11055537B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 5, 2016 |
| Grant date | Jul 6, 2021 |
| Priority date | — |
| Expiry date | Sep 5, 2038 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V2201/07
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
There is provided a system comprising a label database including a plurality of label, a non-transitory memory storing an executable code, and a hardware processor executing the executable code to receive a media content including a plurality of segments, each segment including a plurality of frames, extract a first plurality of features from a segment, extract a second plurality of features from each frame of the segment, determine an attention weight for each frame of the segment based on the first plurality of features extracted from the segment and the second plurality of features extracted from the segment, and determine that the segment depicts one of the plurality of labels in a label database based on the first plurality of features, the second plurality of features, and the attention weight of each frame of the plurality of frames of the segment.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.