DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Apache Data Lakehouse Weekly: June 9 to 16, 2026

Apache Data Lakehouse Weekly: June 9 to 16, 2026

Comments
24 min read
41/60 Days System Design Questions

41/60 Days System Design Questions

2
Comments 4
1 min read
# Why Breaking Things Apart Eventually Brings Them Together: Modulazition in ETL

# Why Breaking Things Apart Eventually Brings Them Together: Modulazition in ETL

Comments
4 min read
Top 12 Spark Interview Problems for Data Engineers, With Answers

Top 12 Spark Interview Problems for Data Engineers, With Answers

Comments
10 min read
I Built a Mini Message Broker in Pure Python and Finally Understood How Kafka Moves Millions of Events

I Built a Mini Message Broker in Pure Python and Finally Understood How Kafka Moves Millions of Events

1
Comments
6 min read
Why Audit Trails Matter in ClickHouse®: Building Accountability, Compliance, and Security

Why Audit Trails Matter in ClickHouse®: Building Accountability, Compliance, and Security

2
Comments 1
4 min read
Creating HIPAA-Safe Synthetic Patient Data for Healthcare App Testing

Creating HIPAA-Safe Synthetic Patient Data for Healthcare App Testing

Comments
4 min read
# From Metadata to Knowledge Discovery: Why I Am Not Starting With a Chatbot

# From Metadata to Knowledge Discovery: Why I Am Not Starting With a Chatbot

Comments
3 min read
Linux Fundamentals for Data Engineering: A Practical Hands-On Guide

Linux Fundamentals for Data Engineering: A Practical Hands-On Guide

Comments 1
5 min read
Day 19 of 100 Days of ClickHouse®: Managing Users and Roles with RBAC

Day 19 of 100 Days of ClickHouse®: Managing Users and Roles with RBAC

2
Comments
4 min read
Building Confidence Scoring for Email Open Tracking (Engineering Notes)

Building Confidence Scoring for Email Open Tracking (Engineering Notes)

Comments
2 min read
You can do WHAT with a Kafka proxy?

You can do WHAT with a Kafka proxy?

Comments
4 min read
Iceduck: A Local Data Lakehouse Stack for Learning (No Cloud Needed)

Iceduck: A Local Data Lakehouse Stack for Learning (No Cloud Needed)

Comments
1 min read
I Built a B-Tree in Pure Python and Finally Understood Why Postgres Uses It for Every Index

I Built a B-Tree in Pure Python and Finally Understood Why Postgres Uses It for Every Index

1
Comments
6 min read
The Data Engineer Roadmap for 2026 (in an AI-Native World)

The Data Engineer Roadmap for 2026 (in an AI-Native World)

Comments
7 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.