Patent · US Active

Systems and methods for reconstructing voice packets using natural language generation during signal loss

US12334048B2 · kind B2 · utility

0Cited by

6References

20Claims

0Family size

Assignee

Verizon Patent and Licensing Inc. · US

Inventors

Saurabh Tahiliani · Noida, IN
Subham Biswas · Ashti, IN

Key dates

Filing date	Oct 12, 2022
Grant date	Jun 17, 2025
Priority date	—
Expiry date	Aug 25, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/18
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A device may receive and convert audio data to text data in real-time, and may detect a network fluctuation that causes missing voice packets. The device may process partial text and context of the text data, with a model, to generate a new phrase, and may generate a response phoneme for the new phrase. The device may utilize a text embedding model to generate a text embedding for the response phoneme, and may process the audio data, with the model, to generate a target voice sequence. The device may utilize an audio embedding model to generate an audio embedding for the target voice sequence, and may combine the text embedding and the audio embedding to generate an embedding input vector. The device may process the embedding input vector, with an audio synthesis model, to generate a final voice response, and may provide the audio data and the final voice response.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.