Featured Image

Kafka Internals

馃毀 This post is a work in progress, but feel free to explore what鈥檚 here so far. Stay tuned for more! 14 years of Apache Kafka! Kafka is the de facto standard for event streaming, just like AWS S...

Featured Image

Spatial Index: R Trees

If you have been following the Spatial Index Series, it started with the need for multi-dimensional indexes and an introduction to space-filling curves, followed by a deep dive into grid systems (G...

Featured Image

Spatial Index: Tessellation

Brewing! this post a continuation of Spatial Index: Grid Systems where we will set the foundation for tessellation and delve into the details of Uber H3 0. Foundation Tessellation or tiling is the...

Featured Image

Spatial Index: Grid Systems

This post is a continuation of Stomping Grounds: Spatial Indexes, but don鈥檛 worry if you missed the first part鈥攜ou鈥檒l still find plenty of new insights right here. 3. Geohash Geohash: Invented in...

Featured Image

Spatial Index: Space-Filling Curves

0. Overview Spatial data has grown (/is growing) rapidly thanks to web services tracking where and when users do things. Most applications add location tags and often allow users check in specific ...

Featured Image

Real-time insights: Telemetry Pipeline

0. Overview 0.1. Architecture A telemetry pipeline is a system that collects, ingests, processes, stores, and analyzes telemetry data (metrics, logs, traces) from various sources in real-time or n...

Featured Image

Debezium: PostgreSQL Change Data Capture

Figure 1: Debezium Postgres Connector 1. Goal Set up Debezium to capture row-level changes in the schemas of a PostgreSQL database and publish to Kafka topic(s). The high-level architecture is u...

Featured Image

Postgres as a Graph Database

Neo4j vs PostgreSQL Have you ever come across the need to store a graph in a relational database because using/onboarding a graph database for a small use-case is overkill? Before jumping into u...

Featured Image

Why You Should Reinvent the Wheel

Reinvent the Wheel! As I get slightly older and slightly wiser - to me, it鈥檚 more and more apparent that we humans live a life full of contradiction, are rarely self-satisfied, and thrive on lea...

Featured Image

Distributed Model Training

Porco Rosso. Distributed Training Deep learning is a subset of machine learning, a branch of artificial intelligence to perform tasks through experience. Deep learning algorithms are well suited...

Featured Image

Anatomy of a System Design Interview

Complex is Simple. What鈥檚 better than going over a real system design interview question (Interviewed in June 2022). 1. Requirements Review the end-to-end design below. You need to design Your ...