The Modern Data Engineer Part 6: Hard Deletes and the Ghost Row Problem
Incremental pipelines built on updated_at watermarks cannot detect hard deletes. The result is a ghost row in your SCD Type 2 model and quietly wrong analytics downstream.
You may not have guessed it, my name is Bram. One of the things I do is architect data platforms and share opinionated takes on how they should actually work. Here you'll find real insights for real problems, not the usual vendor narrative. Whether you're scaling platforms, making strategic decisions, or questioning your current approach, there's something here worth your time. Because data engineering is a craft.
Building scalable foundations using Cloud Native principles and modern data stacks.
Custom pipeline development and high-throughput stream processing systems.
Bridging the gap between raw data and business decisions through governance and AI readiness.
An opinionated blog designed to challenge decision makers, inspire data professionals, and spark meaningful discussions on the future of data engineering.
Incremental pipelines built on updated_at watermarks cannot detect hard deletes. The result is a ghost row in your SCD Type 2 model and quietly wrong analytics downstream.
SAFe wraps agile vocabulary around a plan-driven framework and sells the result as scaled agility. What it actually scales is the appearance of agility, not the substance. For data teams, the damage is structural and compounding.
Governance only scales when metadata is produced by delivery workflows, not manual stewardship.
For consulting inquiries or architectural engagements, please visit us at Xyntrel.