Patent · US Active

Neural face video compression using multiple views

US11856203B1 · kind B1 · utility

1Cited by

0References

20Claims

0Family size

Assignee

Apple Inc. · US

Inventors

Michael Tobias Tschannen · Zürich, CH
Ali Benlalah · Dietikon, CH
Anna Volokitin · Zürich, CH
Brian Amberg · Zollikon, CH
Sebastian Martin · Zürich, CH
Stefan Brugger · Basel, CH

Key dates

Filing date	Mar 22, 2022
Grant date	Dec 26, 2023
Priority date	—
Expiry date	Mar 22, 2042

Classification

Technology area (CPC H)Electricity
CPC primaryH04N19/42
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

Advances in deep generative models (DGM) have led to the development of neural face video compression codecs that are capable of using an order of magnitude less data than “traditional” engineered codecs. These “neural” codecs can reconstruct a target image by warping a source image to approximate the content of the target image and using a DGM to compensate for imperfections in the warped source image. The determined warping operation may be encoded and transmitted using less data (e.g., transmitting a small number of keypoints, rather than a dense flow field), leading to the bandwidth savings compared to traditional codecs. However, by relying on a single source image only, these methods can lead to inaccurate reconstructions. The techniques presented herein improve image reconstruction quality while maintaining bandwidth savings, via a combination of using multiple source images (i.e., containing multiple views of the first human subject) and novel feature aggregation techniques.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.