Federated Analytics

We study analytical query processing across a variety of underlying database systems. This includes the following projects:

PipeGen

PipeGen Logo PipeGen allows users to automatically create an efficient connection between pairs of database systems. PipeGen targets data analytics workloads on shared-nothing engines, and supports scenarios where users seek to perform different parts of an analysis in different DBMSs or want to combine and analyze data stored in different systems. The systems may be colocated in the same cluster or may be in different clusters.

Project Website
 
Read the Paper
 
Get the Code