Patent · US Active

Fusing multilayer and multimodal deep neural networks for video classification

US10402697B2 · kind B2 · utility

9Cited by

0References

20Claims

0Family size

Assignee

NVIDIA Corporation · US

Inventors

Xiaodong Yang · New York, US
Pavlo Molchanov · Santa Clara, US
Jan Kautz · Lexington, US

Key dates

Filing date	Jul 26, 2017
Grant date	Sep 3, 2019
Priority date	—
Expiry date	Oct 9, 2037

Classification

Technology area (CPC G)Physics
CPC primaryG06V20/46
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method, computer readable medium, and system are disclosed for classifying video image data. The method includes the steps of processing training video image data by at least a first layer of a convolutional neural network (CNN) to extract a first set of feature maps and generate classification output data for the training video image data. Spatial classification accuracy data is computed based on the classification output data and target classification output data and spatial discrimination factors for the first layer are computed based on the spatial classification accuracies and the first set of feature maps.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.