Patent · US Active

Language modeling for conversational understanding domains using semantic web resources

US9679558B2 · kind B2 · utility

9Cited by

2References

20Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

Murat Akbacak · Palo Alto, US
Dilek Hakkani-Tur · Los Altos, US
Gokhan Tur · Morristown, US
Larry Paul Heck · Los Altos, US
Benoit Dumoulin · Salaberry-de-Valleyfield, CA

Key dates

Filing date	May 15, 2014
Grant date	Jun 13, 2017
Priority date	—
Expiry date	Jul 30, 2034

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/183
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Systems and methods are provided for training language models using in-domain-like data collected automatically from one or more data sources. The data sources (such as text data or user-interactional data) are mined for specific types of data, including data related to style, content, and probability of relevance, which are then used for language model training. In one embodiment, a language model is trained from features extracted from a knowledge graph modified into a probabilistic graph, where entity popularities are represented and the popularity information is obtained from data sources related to the knowledge. Embodiments of language models trained from this data are particularly suitable for domain-specific conversational understanding tasks where natural language is used, such as user interaction with a game console or a personal assistant application on personal device.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.