Patent · US Active

Systems and methods for reconstructing voice packets using natural language generation during signal loss

US12334048B2 · kind B2 · utility

0Cited by
6References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 12, 2022
Grant dateJun 17, 2025
Priority date
Expiry dateAug 25, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/18
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A device may receive and convert audio data to text data in real-time, and may detect a network fluctuation that causes missing voice packets. The device may process partial text and context of the text data, with a model, to generate a new phrase, and may generate a response phoneme for the new phrase. The device may utilize a text embedding model to generate a text embedding for the response phoneme, and may process the audio data, with the model, to generate a target voice sequence. The device may utilize an audio embedding model to generate an audio embedding for the target voice sequence, and may combine the text embedding and the audio embedding to generate an embedding input vector. The device may process the embedding input vector, with an audio synthesis model, to generate a final voice response, and may provide the audio data and the final voice response.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.