Automatic RDF-ization of big data semi-structured datasets
Journal Title: MASKANA - Year 2016, Vol 7, Issue 3
Abstract
Linked data adoption continues to grow in many fields at a considerable pace. However, some of the most important datasets usually remain underexploited because of two main reasons: the huge volume of the datasets and the lack of methods for automatic conversion to RDF. This paper presents an automatic approach to tackle these problems by leveraging recent Big Data tools and a program for automatic conversion from a relational model to RDF. Overall, the process can be summarized in three steps: 1) bulk transfer of data from different sources to Hive/HDFS; 2) transformation of data on Hive to RDF using D2RQ; and 3) storing the resulting RDF in CumulusRDF. By using these Big Data tools, the platform will cope with the handling of big amounts of data available in different sources, which can include structured or semi-structured data. Moreover, since the RDF data are stored in CumulusRDF in the final step, users or applications can consume the resulting data by means of web services or SPARQL queries. Finally, an evaluation in the hydro-meteorological domain demonstrates the soundness of our approach.
Authors and Affiliations
Ronald Gualán, Renán Freire, Andrés Tello, Mauricio Espinoza, Víctor Saquicela
Relación de la seroconversión positiva a Neospora caninum con problemas reproductivos y mortalidad neonatal en vacas Holstein
La neosporosis bovina es una enfermedad parasitaria causada por el protozoo Neospora caninum (NC). Se considera una de las principales causas de aborto en la especie bovina, especialmente en el ganado lechero y está re...
Mining from a conflicting to a collaborative activity: Review of literature
This article states that the confrontational attitude between local communities pushed by lobbying groups, eventually with the support of local governments, and mining companies can be turned into a corporate communica...
Evaluación de las emisiones de vapor mercurial en procesos de amalgamado artesanal: caso Cantón Ponce Enríquez, Provincia del Azuay
La minería a pequeña escala aplicada en varias localidades del Cantón Ponce Enríquez, Provincia del Azuay, Ecuador, emplea comúnmente el proceso de amalgamación con mercurio donde éste se adhiere a las partículas de or...
Modelamiento de operación de embalses para el proyecto integral de riego en la cuenca del río Macul
Un proyecto de riego se ha planeado en la cuenca del Río Macul, Provincia de los Ríos, para el desarrollo de actividades agrícolas, las cuales representan el principal ingreso económico en la Región. Este sistema integ...
Integration and massive storage of hydro-meteorological data combining big data & semantic web technologies
Ecuador contains an immense collection of hydro-meteorological data, informing us via standards how to locate and invoke them. If we want to make such data easier to understand and use, we need to store them in a commo...