Systems and methods for reconstructing voice packets using natural language generation during signal loss
US12334048B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 12, 2022 |
| Grant date | Jun 17, 2025 |
| Priority date | — |
| Expiry date | Aug 25, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/18
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A device may receive and convert audio data to text data in real-time, and may detect a network fluctuation that causes missing voice packets. The device may process partial text and context of the text data, with a model, to generate a new phrase, and may generate a response phoneme for the new phrase. The device may utilize a text embedding model to generate a text embedding for the response phoneme, and may process the audio data, with the model, to generate a target voice sequence. The device may utilize an audio embedding model to generate an audio embedding for the target voice sequence, and may combine the text embedding and the audio embedding to generate an embedding input vector. The device may process the embedding input vector, with an audio synthesis model, to generate a final voice response, and may provide the audio data and the final voice response.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.