Speech synthesis method and apparatus, computer device and readable medium
US10825444B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 7, 2018 |
| Grant date | Nov 3, 2020 |
| Priority date | — |
| Expiry date | Mar 27, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L13/047
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present disclosure provides a speech synthesis method and apparatus, a computer device and a readable medium. The method comprises: when problematic speech appears in speech splicing and synthesis, predicting a time length of a state of each phoneme corresponding to a target text corresponding to the problematic speech and a base frequency of each frame, according to pre-trained time length predicting model and base frequency predicting model; according to the time length of the state of each phoneme corresponding to the target text and the base frequency of each frame, using a pre-trained speech synthesis model to synthesize speech corresponding to the target text; wherein the time length predicting model, the base frequency predicting model and the speech synthesis model are all obtained by training based on a speech library resulting from speech splicing and synthesis. The technical solution of the present disclosure may avoid complementarily recording language materials and re-building a library, effectively shorten the time for repair of the problematic speech, and save the repair costs of the problematic problem; it may be ensured that naturalness and continuity of the sy…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.