Method and system of adding punctuation and establishing language model using a punctuation weighting applied to chinese speech recognized text
US9811517B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 6, 2014 |
| Grant date | Nov 7, 2017 |
| Priority date | — |
| Expiry date | Dec 28, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/26
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method of processing information content based on a Chinese language model is performed at a computer, the method including: identifying a plurality of expressions in the information content extracted from a speech input through speech recognition that is queued to be processed; dividing the expressions into a plurality of characteristic units according to semantic features and predetermined characteristics associated with each characteristic unit, each including a subset of the expressions and the predetermined characteristics at least including a respective integer number of expressions that are included in the characteristic unit; extracting, from the Chinese language model, a plurality of probabilities for punctuation marks associated with each characteristic unit; and in accordance with the probabilities, associating a respective punctuation mark with each characteristic unit included in the information content. The method further comprises adding punctuation marks based on a weight determined for each punctuation mark.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.