Methods and systems for enhancing the detection of synthetic voice data
US12131750B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | May 10, 2024 |
| Grant date | Oct 29, 2024 |
| Priority date | — |
| Expiry date | May 10, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2025/783
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for enhancing detection of synthetic voice data is provided that includes converting, by an electronic device, monophonic voice data into stereophonic voice data. The stereophonic voice data includes a first channel signal and a second channel signal. Moreover, the method includes decomposing, by a trained machine learning model, the stereophonic voice data into a mid-signal and a side signal. The method also includes determining artifacts indicative of synthetic generation in the structured and secondary artifacts, calculating, based on the determined artifacts, a probability score reflecting the likelihood the monophonic voice data was synthetically generated, and comparing the probability score against a threshold value. When the probability score satisfies the threshold value, there is a high likelihood that the monophonic voice data includes synthetic artifacts, and an alert is generated indicating the monophonic voice data is potentially fraudulent.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.