Device, method, and program for analyzing speech signal
US11798579B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 19, 2019 |
| Grant date | Oct 24, 2023 |
| Priority date | — |
| Expiry date | Jun 21, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/75
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A parameter included in a fundamental frequency pattern of a voice can be estimated from the fundamental frequency pattern with high accuracy and the fundamental frequency pattern of the voice can be reconstructed from the parameter included in the fundamental frequency pattern. A learning unit 30 learns a deep generation model including an encoder which regards a parameter included in a fundamental frequency pattern in a voice signal as a latent variable of the deep generation model and estimates the latent variable from the fundamental frequency pattern in the voice signal on the basis of parallel data of the fundamental frequency pattern in the voice signal and the parameter included in the fundamental frequency pattern in the voice signal, and a decoder which reconstructs the fundamental frequency pattern in the voice signal from the latent variable.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.