The project Linked Data for Professional Education (LD4PE), funded between 2014 and 2017 by the Institute of Museum and Library Studies (IMLS) and lead by the University of Washington Information School, developed a web-based exploratorium to support structured discovery of online learning resources about Linked Data. The project produced this website, which has been converted into a static site for preservation, and a Linked Data Competency Index. DCMI will keep this site online on a "best effort" basis as long as resources permit. The Internet Archive's Wayback Machine should be regarded as the source of archival copies for the long term.

Free Your Metadata: Named entity extraction

A brief tutorial explaining how to enrich a dataset even when many fields (notoriously description) contain unstructured text. To capture this potentially interesting information in machine-processable format, named entity recognition can be used via an extension to Open Refine (formerly Google Refine). This tutorial builds on two previous ones which explain how to clean and reconcile
an example dataset (from the Powerhouse Museum) to a specific controlled vocabulary (in the example, the Library of Congress Subject Headings). The textual walk-through demonstrates how to install the extension, open a new project, and perform the extraction of named entities.

URL: http://freeyourmetadata.org/named-entity-extraction/
Keywords: Open Refine, Google Refine, Named-entity extraction
Author: Verborgh, Ruben
Publisher: MaSTIC
Date created: 2016-01-01 05:00:00.000
Language: http://id.loc.gov/vocabulary/iso639-2/eng
Time required: P20M
Interactivity type: active

Competencies

Uses available resources for named entity recognition, extraction, and reconciliation.

Comments are closed.

Sign in or Join

Log in

Free Your Metadata: Named entity extraction