Efficient character counting for variable length encoding formats
US12425046B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 6, 2023 |
| Grant date | Sep 23, 2025 |
| Priority date | — |
| Expiry date | Mar 27, 2044 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH03M7/705
- WIPO fieldBasic communication processes
- WIPO sectorElectrical engineering
Abstract
Technologies and solutions are also provided for determining a number of characters in encoded data, particularly for encoding formats that have variable byte lengths. Bytes in the encoding format can have different types, including at least one type that represents a continuation byte. That is, rather than having all data for a character being in a single byte, the data is encoded using two or more bytes. The number of continuation bytes can be counted and subtracted from a total number of bytes in a data set to determine the number of characters in the data set. Optionally, the validity of the data set with respect to an encoding format can be determined prior to, or concurrently with, determining a number of characters in the data set. SIMD techniques can be used with the character counting/validation processes to improve their performance.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.