Fusing multilayer and multimodal deep neural networks for video classification
US10402697B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 26, 2017 |
| Grant date | Sep 3, 2019 |
| Priority date | — |
| Expiry date | Oct 9, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V20/46
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method, computer readable medium, and system are disclosed for classifying video image data. The method includes the steps of processing training video image data by at least a first layer of a convolutional neural network (CNN) to extract a first set of feature maps and generate classification output data for the training video image data. Spatial classification accuracy data is computed based on the classification output data and target classification output data and spatial discrimination factors for the first layer are computed based on the spatial classification accuracies and the first set of feature maps.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.