Patent · US Active

Microphone array based deep learning for time-domain speech signal extraction

US11508388B1 · kind B1 · utility

3Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 20, 2020
Grant dateNov 22, 2022
Priority date
Expiry dateFeb 2, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/02166
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A device for processing audio signals in a time-domain includes a processor configured to receive multiple audio signals corresponding to respective microphones of at least two or more microphones of the device, at least one of the multiple audio signals comprising speech of a user of the device. The processor is configured to provide the multiple audio signals to a machine learning model, the machine learning model having been trained based at least in part on an expected position of the user of the device and expected positions of the respective microphones on the device. The processor is configured to provide an audio signal that is enhanced with respect to the speech of the user relative to the multiple audio signals, wherein the audio signal is a waveform output from the machine learning model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.