Patent · US Active

Method and system of adding punctuation and establishing language model using a punctuation weighting applied to chinese speech recognized text

US9811517B2 · kind B2 · utility

1Cited by
5References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 6, 2014
Grant dateNov 7, 2017
Priority date
Expiry dateDec 28, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/26
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method of processing information content based on a Chinese language model is performed at a computer, the method including: identifying a plurality of expressions in the information content extracted from a speech input through speech recognition that is queued to be processed; dividing the expressions into a plurality of characteristic units according to semantic features and predetermined characteristics associated with each characteristic unit, each including a subset of the expressions and the predetermined characteristics at least including a respective integer number of expressions that are included in the characteristic unit; extracting, from the Chinese language model, a plurality of probabilities for punctuation marks associated with each characteristic unit; and in accordance with the probabilities, associating a respective punctuation mark with each characteristic unit included in the information content. The method further comprises adding punctuation marks based on a weight determined for each punctuation mark.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.