Patent · US Active

Inaudible watermark enabled text-to-speech framework

US11138964B2 · kind B2 · utility

2Cited by

5References

20Claims

0Family size

Assignee

BAIDU USA LLC · US

Inventors

Wei Ping · Sunnyvale, US
Zhenyu Zhong · Tseung Kwan O, CN
Yueqiang Cheng · Sunnyvale, US
Xing Li · Webster, US
Tao Wei · London, GB

Key dates

Filing date	Oct 21, 2019
Grant date	Oct 5, 2021
Priority date	—
Expiry date	Oct 21, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

According to various embodiments, an end-to-end TTS framework can integrate a watermarking process into the training of the TTS framework, which enables watermarks to be imperceptible within a synthesized/cloned audio segment generated by the TTS framework. The watermarks added in such a matter are statistically undetectable to prevent authorized removal. According to an exemplary method of training the TTS framework, a TTS neural network model and a watermarking neural network mode in the TTS framework are trained in an end to end manner, with the watermarking being part of the optimization process of the TTS framework. During the training, neuron values of the TTS neural network model are adjusted based on training data to prepare one or more spaces for adding a watermark in a synthesized audio segment to be generated by the TTS framework.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.