Patent · US Active

Knowledge transfer in permutation invariant training for single-channel multi-talker speech recognition

US10699697B2 · kind B2 · utility

52Cited by
10References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 29, 2018
Grant dateJun 30, 2020
Priority date
Expiry dateAug 17, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/063
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Provided are a speech recognition training processing method and an apparatus including the same. The speech recognition training processing method includes acquiring a multi-talker mixed speech signal from a plurality of speakers, performing permutation invariant training (PIT) model training on the multi-talker mixed speech signal based on knowledge from a single-talker speech recognition model and updating a multi-talker speech recognition model based on a result of the PIT model training.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.