• No results found

Entity Reconciliation using Object Similarity Summary

N/A
N/A
Protected

Academic year: 2021

Share "Entity Reconciliation using Object Similarity Summary"

Copied!
1
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

73

Summary

Entity Reconciliation using Object Similarity

Case ‘Matching person entities without the existence a common identifier’ For research purposes a unified and integral view on entities in the field of police and justice is of crucial importance. For several reasons, e.g. enforcement of privacy, linking databases on primary and foreign keys is not always possible or desired. We have developed an approach that focuses on reconciliation in these situations. Our approach is based on exploiting a set of overlapping and related attributes. An attribute in this set does not uniquely identify an entity, but discriminates entities to a certain extent (i.e., the selectivity factor is not too large).

To perform reconciliations, we have combined schema information and the content of databases with available domain knowledge of experts. Schema in-formation of different databases is used to determine what parts of a schema pertain to the same real-world entity. The content of the databases and avail-able domain knowledge are used to define similarity functions. These functions are used to decide whether tuples in different databases refer to the same real-world entity or not.

We have implemented our approach, resulting in a prototype called EROS. We have applied EROS on two databases in the field of police and justice. It appears that our approach can be marked as quite effective, since more than 93% of the tuples have been correctly reconciliated.

For the time-being EROS is able to process only two databases at a time. Ex-tending EROS as such that it is able to process more than two databases at a time in an efficient way is a topic for further research. Extending the knowledge system of EROS with more rules is another topic for further research. To what extent our approach can be generalized is also a topic that needs attention.

Referenties

GERELATEERDE DOCUMENTEN

Hassel (1951) describes some experiments, in which visual acuity, determined by means of Landolt-rings, is compared for 'daylight'-fluorescent lamps and incandescent

Previous research suggests that not all newcomers are directly accepted within an existing team (Moreland & Levine, 1982; Choi & Thompson, 2005). The objective

Mais, c’est précisément dans ce genre de contrôle que l’introduction d’un niveau de sécurité devient très délicat étant donné qu’il est impossible de

Szajnberg, Skrinjaric, and Moore 1989 studied a small sample of eight mono- and dizygotic twins and found a concordance of 63%; three of the four monozygotic twin pairs 75%

Unless the original article in the bibliographic database is clearly known to be a retracted, researchers may not know that the paper has been withdrawn and the

Interlocking is a mechanism what uses the roughness of the surrounded tissue for adhesion, instead of the surface free energy what is the main adhesion mechanism used by

• The final author version and the galley proof are versions of the publication after peer review.. • The final published version features the final layout of the paper including

Aangezien er echter geen duidelijk dateerbare sporen noch structuren aangetroffen werden, lijkt een vervolgonderzoek niet aangewezen te zijn. Bij het archeologisch