Linked Data 2017

 

Overview

Linked Data and other Semantic Web technologies are used in a variety of organisations for projects large and small. They are a key feature in both government open data initiatives and enterprise-wide data integration systems for internal use. In this course, you'll learn about the building blocks of the Semantic Web and how to use them, including how to model your data in RDF, integrate with third-party and OpenData sources, and how to enter and run SPARQL queries. You'll hear about how these technologies are used today, and have a chance to try them out in the hands-on portions of the classes.

This course is chaired by Kal Ahmed and taught by Andy Seaborne, Cathy Dolbear, Jen Williams, Kal Ahmed, and Stuart Williams.

Classes for 2017

The Linked Data course runs on and .

An Introduction to Linked Data

Taught by Jen Williams.

Linked Data is a set of best practises that defines how to structure data so that it can be interlinked and published on the Web. By using the Web as the publishing platform, data can distributed across innumerate points, while still allowing navigation and discovery through the use of URIs in the data itself. Data models are embedded into the structure of the Linked Data itself, and follow the same best practises by being defined as Linked Data Vocabularies.

The principles of Linked Data allows developers to publish data without the need to develop custom APIs, and offers the ability for data publishers to adopt data standards to improve interoperability between disparate sources of data.

This "Introduction to Linked Data" course will cover all of the basics around the structure of the data itself, the use of URIs, popular vocabularies and how to use them in sample datasets.

Delivering Linked Data

Taught by Kal Ahmed.

The infrastructure of the world wide web can do more than deliver documents for people to read off of their screens: it can also deliver data for applications to use. The principles of Linked Data have laid a foundation that has made it possible for governments, media, and e-commerce retailers to publish data on the web without depending on custom-built APIs. This class will show you how to take advantage of these principles to consume available data and to publish it yourself.

Among other things, we'll learn about popular sets of linked data that you can use, how to create links between datasets, how to mint good URIs, HTTP issues, and how to take gradual steps toward good linked data publishing.

Introduction to SPARQL

Taught by Andy Seaborne.

SPARQL is the standard W3C query language for semantic web applications. It brings together the features of a number of RDF query languages into one method for extracting information from data represented in RDF, whether small datasets or large.

This session will provide a solid grounding in SPARQL. After demonstrating how powerful some very simple SPARQL queries can be, we will take a practical approach to looking at the key features of SPARQL 1.0 and 1.1, and then explore the principles underpinning the SPARQL query language.

Hands-on SPARQL

Taught by Andy Seaborne, Jen Williams, and Kal Ahmed.

In this session students will get a chance to try their hand at writing some SPARQL queries. This session will help to consolidate the fundamentals of SPARQL query as well as introducing some new tips and tricks for being more effective with the SPARQL that you write.

For this session students will need a laptop with a wi-fi connection and a browser capable of running modern Javascript applications (e.g. Chrome or a recent version of Firefox).

From CSV to RDF

Taught by Kal Ahmed.

CSV (comma-separated values) files are one of the simplest ways we have of sharing data. From databases and spreadsheets to content management systems and CRMs, many applications are capable of producing CSV data.

This session will focus on how we can create and publish Linked Data from this popular data format. Using the free publishing service DataDock.io we will show how easy it is to map simple CSV data to an ontology and publish it. We will then go on to show how to make use of common tools and open-source products such as OpenRefine to get more creative with formatting and structuring data for publication.

RDF Modelling

Taught by Stuart Williams.

Modelling the world in RDF involves identifying the important entity types within some domain of interest, their properties and relationships. In this session we will workshop the development of an RDF data model for a familiar domain.

We’ll look at the open world nature of RDF models and their focus on making statements about things in the world. We’ll build up a data model as we go along, questioning its purpose and considering the questions (possibly in the form of SPARQL queries) we’d like to be able to answer from data expressed using the model. We’ll take a quick look at expressing our evolving model in RDFS and OWL. Finally we’ll contrast the model we develop in the session with one that’s already been developed for the same domain.

Linked Data In Practice

Taught by Cathy Dolbear.

In this session, guest speakers from industry will give brief talks on how they are using Linked Data in practice.

Experiments in Using Linked Data to verify Knowledge Graphs in Publishing

This talk presents the use case at Oxford University Press of using knowledge graphs for named entity recognition, and how we are trialling the use of Linked Open Data sources such as DBpedia and Geonames to verify and disambiguate the knowledge extracted from our content data. Cathy will describe how we can move from simple subject categorisation to a richer, more nuanced description of the people, places, organisations and other things mentioned in a text, based on a separation of concerns between the content XML, the RDF metadata layer, and the entity knowledge graph.

Linked Data Quiz and Panel Session

Taught by Andy Seaborne, Cathy Dolbear, Jen Williams, Kal Ahmed, and Stuart Williams.

Test what you have learnt with this fun quick-fire multiple-choice battle! Points are scored for being the fastest to pick the correct answer, but beware sometimes there is more than one right choice!

To round off the Linked Data track, all of our presenters and guest speakers will form a panel to discuss the practical aspects to publishing and consuming Linked Data. This year the focus is "Collaborating and Competing". The discussion will focus on the question of how publishers can collaborate on publishing Linked Data while still competing in an open marketplace.

This is an opportunity to participate in a lively discussion and to put your own questions to the panel and draw on their collective wisdom and experience.