Vista previa del documento Digital citizen empowerment: A systematic literature review of theories and development models por Swapnil Sharma - Digital citizen empowerment a sytematic literature review -fusionado.pdf - Página 38/202 - Caja PDF

Digital citizen empowerment a sytematic literature review fusionado.pdf

Vista previa del archivo PDF digital-citizen-empowerment-a-sytematic-literature-review--fusio.pdf

Página 1...36 37 3839 40 202

Vista previa de texto

74

Ivan Bedini, Feroz Farazi, David Leoni, Juan Pane, Ivan Tankoyeu, Stefano Leucci

Figure 3: Dataset Selection step of ODR pipeline

Figure 4: An example of a simplified schema matching
Once dataset schema has been determined, during attribute value validation step the user can
adapt the dataset to the schema, exploiting OpenRefine data cleansing capabilities.
Successive attribute value disambiguation step employs Natural Language Processing
techniques for enriching dataset content by linking names to known entities (such as Dante
Alighieri, Florence) and words to concepts (such as male, city). In Figure 5 we see a screenshot of
long text that has been automatically enriched. OpenDataRise will show in red elements that still
require manual intervention from the user.
Within entity alignment step the framework considers rows in the dataset as entities, i.e. real
instances. The goal of this step is to schedule changes to entity storage to be committed in the
next step. Such changes can be either update of existing matching entities or creation of new
entities with values from the source dataset.

CC: Creative Commons License, 2014.