How to build a modern, scalable data platform to power your analytics and data science projects. Table of Contents: The…
Big Data
Modern Unified Data Architecture
Today, most business value is derived from the analysis of data and products powered by data, rather than the software…
Lambda and Kappa Architectures in Brief
When working with data engineering solutions in the real world, the main problem faced by data engineers in general is…
6 SQL Data Warehouse Solutions For Big Data Analysts (With Their Pros And Cons)
Are you confused about which SQL Query tool is best for your organization? In this technically dynamic world where data…
The challenges of running Druid at large scale, and future directions, part 2
In the previous post I described how Druid time series data store is used at Metamarkets and discussed some of the major challenges that we face…
The challenges of running Druid at large scale, and future directions, part 1
Druid is a time-series data store. I’m a Druid committer and one of the people at Metamarkets who operate the largest known Druid…
Big data architectures
A big data architecture is designed to handle the ingestion, processing, and analysis of data that is too large or…
Engineering Data Analytics with Presto and Apache Parquet at Uber
From determining the most convenient rider pickup points to predicting the fastest routes, Uber uses data-driven analytics to create seamless…