Patent · US Active

Speech synthesis method and apparatus, computer device and readable medium

US10825444B2 · kind B2 · utility

0Cited by
0References
6Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 7, 2018
Grant dateNov 3, 2020
Priority date
Expiry dateMar 27, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L13/047
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present disclosure provides a speech synthesis method and apparatus, a computer device and a readable medium. The method comprises: when problematic speech appears in speech splicing and synthesis, predicting a time length of a state of each phoneme corresponding to a target text corresponding to the problematic speech and a base frequency of each frame, according to pre-trained time length predicting model and base frequency predicting model; according to the time length of the state of each phoneme corresponding to the target text and the base frequency of each frame, using a pre-trained speech synthesis model to synthesize speech corresponding to the target text; wherein the time length predicting model, the base frequency predicting model and the speech synthesis model are all obtained by training based on a speech library resulting from speech splicing and synthesis. The technical solution of the present disclosure may avoid complementarily recording language materials and re-building a library, effectively shorten the time for repair of the problematic speech, and save the repair costs of the problematic problem; it may be ensured that naturalness and continuity of the sy…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.