Patent · US Active

Text formatter

US11809804B2 · kind B2 · utility

4Cited by
16References
23Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 26, 2021
Grant dateNov 7, 2023
Priority date
Expiry dateJun 24, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/30
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, systems, and computer programs are presented for formatting raw text. One method includes an operation for accessing raw text comprising words corresponding to one or more sentences. The raw text is lowercase text without any punctuation. Further, the method includes operations for creating a plurality of sub-words corresponding to the raw text, and for generating, by a machine-learning (ML) model, an output for each sub-word based on the created sub-words. The output for each sub-word indicates a formatting operation for the corresponding sub-word. The method further includes an operation for generating, based on the formatting operations in the outputs for the sub-words, formatted text corresponding to the raw text. The formatted text is text with correct grammar, proper punctuation, and proper capitalization according to a meaning of words spoken by a speaker associated with the raw text.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.