
Think of us as "GitHub for data pipelines" or "decentralized Databricks".
We take powerful data infrastructure used by millions of tech companies and extend it to work across organizational boundaries:
We make real-time data as composable as software libraries on GitHub and crowd-source data cleaning and integration to the global community.