Reducing false positives in data validation using statistical heuristics
US8959047B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 10, 2012 |
| Grant date | Feb 17, 2015 |
| Priority date | — |
| Expiry date | Jan 24, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N5/02
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
To validate data, a plurality of strings that match a predetermined regular expression is extracted from the data. A validated subset of the strings is identified. To determine whether the validated subset has been falsely validated, it is determined whether the validated subset satisfies each of one or more predetermined criteria relative to the plurality of strings. In one embodiment, the subset is determined to be falsely validated if at least one of the criteria is satisfied. In another embodiment, the subset is determined to be falsely validated if all of the criteria are satisfied. The data are released only if the subset is determined to be falsely validated.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.