DEV Community

# dataengineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
QN : Ingest and transform data in a lakehouse

QN : Ingest and transform data in a lakehouse

Comments
2 min read
Apache Kafka Explained: A Practical Beginner Guide for Data Engineers

Apache Kafka Explained: A Practical Beginner Guide for Data Engineers

3
Comments
9 min read
Honest Memory: What Production Accuracy Data Actually Shows About AI Agent Memory

Honest Memory: What Production Accuracy Data Actually Shows About AI Agent Memory

Comments
4 min read
Building on Brazilian public data: a developer's field guide (CNPJ, CEP, Congress, BACEN)

Building on Brazilian public data: a developer's field guide (CNPJ, CEP, Congress, BACEN)

Comments
2 min read
I let an AI agent set up my entire Kafka platform. Here's what actually happened.

I let an AI agent set up my entire Kafka platform. Here's what actually happened.

Comments
4 min read
LINUX FUNDAMENTALS FOR DATA ENGINEERING

LINUX FUNDAMENTALS FOR DATA ENGINEERING

Comments
5 min read
Advanced Kubernetes Patterns for Data Engineers

Advanced Kubernetes Patterns for Data Engineers

5
Comments
1 min read
Extract data from Databases into DuckLake

Extract data from Databases into DuckLake

Comments
4 min read
How I forced Python standard libraries to process and serialize production server crashes into Parquet locally

How I forced Python standard libraries to process and serialize production server crashes into Parquet locally

Comments
1 min read
From Clean CSVs to Production‑Shaped Data: A Practical Guide for Academic ML and Data Engineering

From Clean CSVs to Production‑Shaped Data: A Practical Guide for Academic ML and Data Engineering

Comments
5 min read
I Built a Write-Ahead Log in Pure Python and Finally Understood How Databases Survive Crashes

I Built a Write-Ahead Log in Pure Python and Finally Understood How Databases Survive Crashes

Comments
7 min read
Architecture Over Alerts: How We Cut BigQuery Costs by 57%($12M) for a Fortune 500

Architecture Over Alerts: How We Cut BigQuery Costs by 57%($12M) for a Fortune 500

Comments
4 min read
I Built a Columnar File Format in Pure Python — a tiny, readable Parquet

I Built a Columnar File Format in Pure Python — a tiny, readable Parquet

Comments
6 min read
Running Apache Airflow + Docker for Free Using GitHub Codespaces

Running Apache Airflow + Docker for Free Using GitHub Codespaces

Comments
1 min read
Why Big Tech is Migrating from Traditional Databases to NewSQL

Why Big Tech is Migrating from Traditional Databases to NewSQL

1
Comments
1 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.