Seamless scalable decoding of channels, objects, and HOA audio content
US12380904B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 10, 2021 |
| Grant date | Aug 5, 2025 |
| Priority date | — |
| Expiry date | Jul 26, 2042 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04S2420/11
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed are methods and systems for decoding immersive audio content encoded by an adaptive number of scene elements for channels, audio objects, higher-order ambisonics (HOA), and/or other sound field representations. The decoded audio is rendered to the speaker configuration of a playback device. For bit streams that represent audio scenes with a different mixture of channels, objects, and/or HOA in consecutive frames, fade-in of the new frame and fade-out of the old frame may be performed. Crossfading between consecutive frames happen in the speaker layout after rendering, in the spatially decoded content type before rendering, or between the transport channels as the output of the baseline decoder but before spatial decoding and rendering. Crossfading may use an immediate fade-in and fade-out frame (IFFF) for the transition frame or may use an overlap-add synthesis technique such as time-domain aliasing cancellation (TDAC) of MDCT.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.