Query-independent entity importance in books
US7958128B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 15, 2010 |
| Grant date | Jun 7, 2011 |
| Priority date | — |
| Expiry date | Jul 15, 2030 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/3331
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A corpus contains information including text from books and metadata about the books. The book texts mention entities of various types, such as location, date, event, and person entities. An entity importance engine analyzes the information in the corpus to identify the entities mentioned therein, and ranks the entities using query-independent importance scores. The importance scores can be based in part on the contexts in which the entities are mentioned by the books. A presentation engine enables searching of the corpus to identify books satisfying a search query. The presentation engine presents information about a designated book. Entities mentioned in the book are presented in a style selected to enhance comprehension and utility. For example, location entities can be presented on a map with markers showing the locations of the entities. The number of entities presented are limited and ordered based on the query-independent importance scores.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.