Data Abstraction and Virtualization

Access and import data sources into the DataPorts Platform

2 minute read

Data virtualization is a data integration technique that provides access to information via a virtualized service layer, regardless of the location of the data sources. It allows applications to access data, from a variety of heterogeneous sources, through a single endpoint, thus providing a unified, abstracted, and encapsulated view of information for query purposes, while being able to transform and process the data to prepare it for consumption. A significant challenge in data virtualization is to manage different types of storage systems (e.g., key-value, document, or relational databases) which all need to be integrated. In addition, data-intensive applications, that use a virtualized data source, still expect certain quality of service guarantees from the system, such as performance, availability, etc. The Data Abstraction and Virtualization (DAV) component attempts to deal with those challenges, also contributing to the data interoperability of the platform. Furthermore, it focuses on the fulfilment of the project’s requirements that are related to quality of data.

Features

In a nutshell, Data Abstraction & Virtualization (DAV) is responsible for:

Correctly preparing data input from different sources inside the generic DataPorts architecture
Maintaining metadata from all feeds
Exporting the “cleaned” & processed datasets through exposed RESTful APIs, thus making them available to any eventual client. Persistent data streams (that is, data that has already been collected and stored) are the primary source of load for DAV.

Demo | Screenshots

DAV’s nature is to function in the background, accepting incomming datasets, filtering them, storing them and then forwarding them to clients. There is no UI or some kind of command interface. Therefore, a Demo is not applicable.

Credits

Achilleas Marinakis - achmarin@mail.ntua.gr
Anastasios (Tasos) Nikolakopoulos - tasosnikolakop@mail.ntua.gr

Data Abstraction and Virtualization

Features

Demo | Screenshots

Credits

Getting Started

How to use it

API

Source Code

Other docs