amt-caa2018

Example of AMT for CAA 2018.

View project on GitHub

Taming Ambiguity - Dealing with doubts in archaeological datasets using LOD

CAA 2018, Session S33: Guaranteeing data quality in archaeological Linked Open Data

Florian Thiery1,3 AND Dr. Allard Mees2,3

1 Florian Thiery M.Sc. thiery@rgzm.de ORCID iD iconorcid.org/0000-0002-3246-3531

2 Dr. Allard Mees FSA mees@rgzm.de

3 Römisch-Germanisches Zentralmuseum Mainz, Leibniz-Forschungsinstitut für Archäologie rgzm.de, Kompetenzbereich Wissenschaftliche IT

keywords: Linked Open Data, data quality controlled vocabulary NAVIS ship database Samian Research

The Linked Data Cloud is full of controlled resources, which in fact quickly run out of control. Firstly, each resource collection, e.g. a thesaurus, is cooking its own soup related to its research context. Secondly, conceptualisation of LOD assumes standardised data, but in reality, only generic concepts or real instances exist. Thirdly, archaeological items are usually related to generic instances in the LOD cloud, based on their object orientated nature. Describing these relations by modelling archaeological assumptions causes ambiguities which have to be tamed to guarantee data quality for reuse, demonstrated in three examples:

(1) Trying to link a triangular “lateen sail” into the LOD cloud, a problem known from the NAVIS II ancient ship depictions database, reveals that each repository has completely different “hidden assumptions” in its hierarchies with respect to their scientific domain. Are the usually SKOS based relations able to solve the challenge to model also the degree of doubt? Are different methods required?

(2) Trying to map pot fragments of Gaulish Terra Sigillata to historically defined concepts of “types” and “service families”, or aligning these typologies, ends up in modelling doubtful assumptions. Can metrics be used to define “type” definitions? How can an already existing online research community help?

(3) Setting up a meta-index for aligning various distributed databases at the RGZM by keywords and linking into the LOD Cloud is subject to uncertainties within the modelling of relations. As a result, this process includes dealing with the above mentioned ambiguity challenges.

Presentation

DOI

Live Demo

Demo

info

info info

info info info info