Patent · US Active

Inaudible watermark enabled text-to-speech framework

US11138964B2 · kind B2 · utility

2Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 21, 2019
Grant dateOct 5, 2021
Priority date
Expiry dateOct 21, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/30
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

According to various embodiments, an end-to-end TTS framework can integrate a watermarking process into the training of the TTS framework, which enables watermarks to be imperceptible within a synthesized/cloned audio segment generated by the TTS framework. The watermarks added in such a matter are statistically undetectable to prevent authorized removal. According to an exemplary method of training the TTS framework, a TTS neural network model and a watermarking neural network mode in the TTS framework are trained in an end to end manner, with the watermarking being part of the optimization process of the TTS framework. During the training, neuron values of the TTS neural network model are adjusted based on training data to prepare one or more spaces for adding a watermark in a synthesized audio segment to be generated by the TTS framework.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.