Patent · US Active

Receptive-field-conforming convolutional models for video coding

US10869036B2 · kind B2 · utility

5Cited by

5References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Claudionor Coelho · Felixlândia, BR
Aki Kuusela · Palo Alto, US
Shan Li · Edison, US
Dake He · Waterloo, CA

Key dates

Filing date	Sep 18, 2018
Grant date	Dec 15, 2020
Priority date	—
Expiry date	Oct 31, 2038

Classification

Technology area (CPC H)Electricity
CPC primaryH04N19/19
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

A convolutional neural network (CNN) for determining a partitioning of a block is disclosed. The block is of size N×N and a smallest partition is of size S×S. The CNN includes feature extraction layers; a concatenation layer that receives, from the feature extraction layers, first feature maps of the block, where each first feature map is of size S×S; and classifiers. Each classifier includes classification layers, each classification layer receives second feature maps having a respective feature dimension. Each classifier is configured to infer partition decisions for sub-blocks of size (αS)×(αS) of the block, wherein α is a power of 2 and α=2, . . . , N/S, by: applying, at some of successive classification layers of the classification layers, a kernel of size 1×1 to reduce the respective feature dimension in half; and outputting by a last layer of the classification layers an output corresponding to a N/(αS)×N/(αS)×1 output map.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.