Show simple item record

dc.contributor.advisorSzlichta, Jaroslaw
dc.contributor.authorKeller, Alexander
dc.date.accessioned2017-08-03T19:59:40Z
dc.date.accessioned2022-03-29T17:39:20Z
dc.date.available2017-08-03T19:59:40Z
dc.date.available2022-03-29T17:39:20Z
dc.date.issued2017-04-01
dc.identifier.urihttps://hdl.handle.net/10155/792
dc.description.abstractPoor data quality has become a pervasive issue due to the increasing complexity and size of modern datasets. Functional dependencies have been used in existing cleaning solutions to model syntactic equivalence. They are not able to model semantic equivelence, however. We advance the state of data quality constraints by defining, discovering, and cleaning Ontology Functional Dependencies. We define their theoretical foundations, including sound and complete axioms, and linear inference procedure. We develop algorithms for data verification, constraint discovery, data cleaning, ontology versus data inconsistency identification, and optimizations to each. Our experimental evaluation shows the scalability and accuracy of our algorithms. We show that ontology FDs are useful to capture domain attribute relationships, and can significantly reduce the number of false positive errors in data cleaning techniques that rely on traditional FDs.en
dc.description.sponsorshipUniversity of Ontario Institute of Technologyen
dc.language.isoenen
dc.subjectConstraintsen
dc.subjectDataen
dc.subjectQualityen
dc.subjectCleaningen
dc.subjectDiscoveryen
dc.titleData curation with ontology functional dependencesen
dc.typeThesisen
dc.degree.levelMaster of Science (MSc)en
dc.degree.disciplineComputer Scienceen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record