Semantic Statistics for Social, Behavioural, and Economic Sciences: Leveraging the DDI Model for the Linked Data Web

October 15-19, 2012

Course Instructors:

Richard Cyganiak (DERI - Digital Enterprise Research Institute, Galway, Ireland)
Arofan Gregory (ODaF - Open Data Foundation, Tucson, Arizona, USA)
Wendy L. Thomas (MPC - Minnesota Population Center, Minneapolis, Minnesota, USA)
Joachim Wackerow (GESIS - Leibniz Institute for the Social Sciences, Mannheim, Germany)

Goals

The previous year's workshop resulted in the creation of a draft RDF vocabulary for the discovery of microdata (unit-record data), based on the Data Documentation Initiative (DDI) model.  One goal will be to build on this model, finalizing it and possibly expanding it to cover a broader set of use cases. A second DDI-based vocabulary was drafted, focusing on an extension of SKOS to describe official classifications used by government agencies and statistical producers - this also should be finalized as a critical set of metadata. DDI further provides mechanisms for addressing some problematic issues within the Web of Linked Data such as provenance, ownership, and versioning, and these themes could be explored. The existing outline and draft of a best practice paper on the publication of microdata and the related metadata into the Linked Data Web will be discussed and may be put forward as a standard for use with data in this domain for dissemination of the Web. Core knowledge on the DDI model and Semantic Web Technologies will be taught.

Description of the workshop

The movement towards more open access to data is being fueled by government initiatives as well as the research community. Statistical data and metadata is already being standardized within the Linked Data Web with the Data Cube vocabulary, based on the SDMX model. There is no equivalent for the discovery and possible use of microdata. In addition, microdata are often confidential, and this aspect of the problem is one which will be a point of discussion in the workshop - how best to advertise the existence of data which cannot be openly exposed? Other aspects of the problem such as quality and documentation issues and provenance need similarly to be addressed.

The Semantic Web and DDI experts approach these issues from different perspectives. By sharing our perspectives, and learning from each other’s experience the goal of the workshop is to develop a best practice for the publication of microdata and related metadata into the Linked Data Web, which might be put forward as a standard for use with data in this domain for dissemination on the Web.

This workshop will examine the metadata model of the Data Documentation Initiative (DDI) used in the Social, Behavioural, and Economic (SBE) sciences, and design an implementation of that model using the Semantic Web standards (RDF, OWL, etc.). Invited participants will represent the user community (data librarians, archivists, researchers, and data producers), DDI experts, and experts in the Semantic Web technologies and standards.

The demand for discovery of both aggregate statistics and the underlying data is strong, and growing through open government initiatives and the efforts of many data producers, data archives, and research centers. Further, Linked Data technologies are becoming increasingly popular within universities, as the basis for tools which can be used to assist research and teaching. Working together to lay out best practices on the publication of microdata and related metadata into the Linked Data Web will benefit both communities and assist researchers in gaining access to digital resources.

Requirements

The workshop is open to anyone interested in the topic. The workshop is held in English.

Maximum number of participants (including instructors): 25

Location

Location

The workshop will take place at Schloss Dagstuhl - Leibniz Center for Informatics, Wadern, GermanyThe location provides an intense working atmosphere in a relaxing environment. Further information about the venue and practical information can be found here (329 KB).

The workshop will be start on Monday October 15th at 9:00 a.m. and will be end on Friday October 19th at approx. 3:00 p.m. Participants are strongly advised to arrive at Dagstuhl on Sunday October 14th. Staying until Saturday October 20th is possible.

About the Instructors

About the Instructors

Richard Cyganiak is working at the Linked Data Research Centre (LiDRC) of DERI.

Arofan Gregory (XML standards expert), Wendy L. Thomas (chair of TIC), and Joachim Wackerow (vice-chair of TIC) are active in the Technical Implementation Committee (TIC) of the DDI Alliance. The workshop is organized in cooperation with the DDI Alliance.

Participation fee

Participation fee

The workshop fee is 300 Euro (reduced fees are possible - please contact us for details). In order to benefit from the reduced fee, a student ID/certificate of enrolment that is valid for the time of the workshop is required. When signing up for the workshop, please let us know if you qualify for the reduced fee. We will then ask you to mail, fax or email a copy/scan of your student ID/certificate of enrolment.

The workshop invoice will be send via letter mail about four weeks before the workshop. It can be paid by bank transfer or by credit card (Master Card or VISA). Usually, we will send the invoice to your private address; in case your institution pays the fee, please provide the billing address in the registration form in the dedicated fields. We will then send the invoice to your institutional address. Further details concerning the payment will be provided after your application was accepted.

Registration information

Registration information

To apply for the workshop, please fill in the online registration form. Please make sure to fill in all required fields. If you do not received an email confirming your application details within 12 hours of applying, please contact us.

The training course is limited to 25 participants. Applications will be processed by application date. This also applies to the waiting list in case the course is booked out when you apply.