In this video, the speaker begins by discussing the Web of Data and Linked Data Principles, then switches focus to the challenges of publishing and maintaining Linked Data datasets over time. This includes data quality issues, such as incompleteness, redundancy, inconsistency, and incorrectness. Difficulty employing entity-relationship model.
Keywords: Entity resolution, Digital preservation, Similarity functions, Web of Data
Author: Christophides, Vassilis
Date created: 2013-11-05 05:00:00.000
Time required: P1H
Educational use: professionalDevelopment
Educational audience: generalPublic
Interactivity type: expositive
- Cleans a dataset by finding and correcting errors, removing duplicates and unwanted data.
- Knows the "five stars" of Open Data: put data on the Web, preferably in a structured and preferably non-proprietary format, using URIs to name things, and link to other data.
- Understands that Linked Data (2006) extended the notion of a web of documents (the Web) to a notion of a web of finer-grained data (the Linked Data cloud).
- Uses available resources for named entity recognition, extraction, and reconciliation.