Patent · US Active

System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification

US9208778B2 · kind B2 · utility

7Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 10, 2014
Grant dateDec 8, 2015
Priority date
Expiry dateNov 10, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/16
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed herein are systems, methods, and non-transitory computer-readable storage media for combining frame and segment level processing, via temporal pooling, for phonetic classification. A frame processor unit receives an input and extracts the time-dependent features from the input. A plurality of pooling interface units generates a plurality of feature vectors based on pooling the time-dependent features and selecting a plurality of time-dependent features according to a plurality of selection strategies. Next, a plurality of segmental classification units generates scores for the feature vectors. Each segmental classification unit (SCU) can be dedicated to a specific pooling interface unit (PIU) to form a PIU-SCU combination. Multiple PIU-SCU combinations can be further combined to form an ensemble of combinations, and the ensemble can be diversified by varying the pooling operations used by the PIU-SCU combinations. Based on the scores, the plurality of segmental classification units selects a class label and returns a result.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.