Inaudible watermark enabled text-to-speech framework
US11138964B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 21, 2019 |
| Grant date | Oct 5, 2021 |
| Priority date | — |
| Expiry date | Oct 21, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/30
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
According to various embodiments, an end-to-end TTS framework can integrate a watermarking process into the training of the TTS framework, which enables watermarks to be imperceptible within a synthesized/cloned audio segment generated by the TTS framework. The watermarks added in such a matter are statistically undetectable to prevent authorized removal. According to an exemplary method of training the TTS framework, a TTS neural network model and a watermarking neural network mode in the TTS framework are trained in an end to end manner, with the watermarking being part of the optimization process of the TTS framework. During the training, neuron values of the TTS neural network model are adjusted based on training data to prepare one or more spaces for adding a watermark in a synthesized audio segment to be generated by the TTS framework.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.