Taming Ambiguity - Dealing with doubts in archaeological datasets using LOD
CAA 2018, Session S33: Guaranteeing data quality in archaeological Linked Open Data
Florian Thiery1,3 AND Dr. Allard Mees2,3
1 Florian Thiery M.Sc. thiery@rgzm.de
orcid.org/0000-0002-3246-3531
2 Dr. Allard Mees FSA mees@rgzm.de
3 Römisch-Germanisches Zentralmuseum Mainz, Leibniz-Forschungsinstitut für Archäologie rgzm.de
, Kompetenzbereich Wissenschaftliche IT
keywords: Linked Open Data
, data quality
controlled vocabulary
NAVIS ship database
Samian Research
The Linked Data Cloud is full of controlled resources, which in fact quickly run out of control. Firstly, each resource collection, e.g. a thesaurus, is cooking its own soup related to its research context. Secondly, conceptualisation of LOD assumes standardised data, but in reality, only generic concepts or real instances exist. Thirdly, archaeological items are usually related to generic instances in the LOD cloud, based on their object orientated nature. Describing these relations by modelling archaeological assumptions causes ambiguities which have to be tamed to guarantee data quality for reuse, demonstrated in three examples:
(1) Trying to link a triangular “lateen sail” into the LOD cloud, a problem known from the NAVIS II ancient ship depictions database, reveals that each repository has completely different “hidden assumptions” in its hierarchies with respect to their scientific domain. Are the usually SKOS based relations able to solve the challenge to model also the degree of doubt? Are different methods required?
(2) Trying to map pot fragments of Gaulish Terra Sigillata to historically defined concepts of “types” and “service families”, or aligning these typologies, ends up in modelling doubtful assumptions. Can metrics be used to define “type” definitions? How can an already existing online research community help?
(3) Setting up a meta-index for aligning various distributed databases at the RGZM by keywords and linking into the LOD Cloud is subject to uncertainties within the modelling of relations. As a result, this process includes dealing with the above mentioned ambiguity challenges.