Patent · US Expired

Handwriting and speech recognizer using neural network with separate start and continuation output scores

US6393395B1 · kind B1 · utility

43Cited by

24References

36Claims

0Family size

Assignee

Microsoft Corporation · US

Inventors

Angshuman Guha · Los Gatos, US
Patrick Haluptzok · Sammamish, US
James A. Pittman · Issaquah, US

Key dates

Filing date	Jan 7, 1999
Grant date	May 21, 2002
Priority date	—
Expiry date	Jan 7, 2019

Classification

Technology area (CPC G)Physics
CPC primaryG06V30/10
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method and system for recognizing user input information including cursive handwriting and spoken words. A time-delayed neural network having an improved architecture is trained at the word level with an improved method, which, along with preprocessing improvements, results in a recognizer with greater recognition accuracy. Preprocessing is performed on the input data and, for example, may include resampling the data with sample points based on the second derivative to focus the recognizer on areas of the input data where the slope change per time is greatest. The input data is segmented, featurized and fed to the time-delayed neural network which outputs a matrix of character scores per segment. The neural network architecture outputs a separate score for the start and the continuation of a character. A dynamic time warp (DTW) is run against dictionary words to find the most probable path through the output matrix for that word, and each word is assigned a score based on the least costly path that can be traversed through the output matrix. The word (or words) with the overall lowest score (or scores) are returned. A DTW is similarly used in training, whereby the sample ink only…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.