Endpointing in speech processing
US12211517B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 15, 2021 |
| Grant date | Jan 28, 2025 |
| Priority date | — |
| Expiry date | Mar 12, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2025/783
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A speech-processing system may determine potential endpoints in a user's speech. Such endpoint prediction may include determining a potential endpoint in a stream of audio data, and may additionally including determining an endpoint score representing a likelihood that the potential endpoint represents an end of speech representing a complete user input. When the potential endpoint has been determined, the system may publish a transcript of speech that preceded the potential endpoint, and send it to downstream components. The system may continue to transcribe audio data and determine additional potential endpoints while the downstream components process the transcript. The downstream components may determine whether the transcript is complete; e.g., represents the entirety of the user input. Final endpoint determinations may be made based on the results of the downstream processing including automatic speech recognition, natural language understanding, etc.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.