Patent · US Active

Exploiting structured content for unsupervised natural language semantic parsing

US10235358B2 · kind B2 · utility

15Cited by
35References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 21, 2013
Grant dateMar 19, 2019
Priority date
Expiry dateMar 18, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/30
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Structured web pages are accessed and parsed to obtain implicit annotation for natural language understanding tasks. Search queries that hit these structured web pages are automatically mined for information that is used to semantically annotate the queries. The automatically annotated queries may be used for automatically building statistical unsupervised slot filling models without using a semantic annotation guideline. For example, tags that are located on a structured web page that are associated with the search query may be used to annotate the query. The mined search queries may be filtered to create a set of queries that is in a form of a natural language query and/or remove queries that are difficult to parse. A natural language model may be trained using the resulting mined queries. Some queries may be set aside for testing and the model may be adapted using in-domain sentences that are not annotated. The models may be tested using these implicitly annotated natural-language-like queries in an unsupervised fashion.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.