Patent · US Active

Speech synthesis method and apparatus, computer device and readable medium

US10825444B2 · kind B2 · utility

0Cited by

0References

6Claims

0Family size

Assignee

BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD. · CN

Inventors

Yu Gu · Tama, JP
Xiaohui Sun · Beijing, CN

Key dates

Filing date	Dec 7, 2018
Grant date	Nov 3, 2020
Priority date	—
Expiry date	Mar 27, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L13/047
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

The present disclosure provides a speech synthesis method and apparatus, a computer device and a readable medium. The method comprises: when problematic speech appears in speech splicing and synthesis, predicting a time length of a state of each phoneme corresponding to a target text corresponding to the problematic speech and a base frequency of each frame, according to pre-trained time length predicting model and base frequency predicting model; according to the time length of the state of each phoneme corresponding to the target text and the base frequency of each frame, using a pre-trained speech synthesis model to synthesize speech corresponding to the target text; wherein the time length predicting model, the base frequency predicting model and the speech synthesis model are all obtained by training based on a speech library resulting from speech splicing and synthesis. The technical solution of the present disclosure may avoid complementarily recording language materials and re-building a library, effectively shorten the time for repair of the problematic speech, and save the repair costs of the problematic problem; it may be ensured that naturalness and continuity of the sy…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.