Patent · US Active

Microphone array based deep learning for time-domain speech signal extraction

US11508388B1 · kind B1 · utility

3Cited by

0References

20Claims

0Family size

Assignee

Apple Inc. · US

Inventors

Mehrez Souden · Los Angeles, US
Symeon Delikaris Manias · Culver City, US
Joshua D. Atkins · Los Angeles, US
Ante Jukic · Los Angeles, US
Ramin Pishehvar · Los Angeles, US

Key dates

Filing date	Nov 20, 2020
Grant date	Nov 22, 2022
Priority date	—
Expiry date	Feb 2, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG10L2021/02166
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A device for processing audio signals in a time-domain includes a processor configured to receive multiple audio signals corresponding to respective microphones of at least two or more microphones of the device, at least one of the multiple audio signals comprising speech of a user of the device. The processor is configured to provide the multiple audio signals to a machine learning model, the machine learning model having been trained based at least in part on an expected position of the user of the device and expected positions of the respective microphones on the device. The processor is configured to provide an audio signal that is enhanced with respect to the speech of the user relative to the multiple audio signals, wherein the audio signal is a waveform output from the machine learning model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.