https://www.startdataengineering.com/post/design-patterns/ Data Pipeline Design Patterns – #1. Data flow patternsStart Data EngineeringDec 11, 2022 “This post will cover the typical data flow design patterns. We will learn about the pros
What I Read: ELT Schedules, Root Cause Analysis
https://www.montecarlodata.com/how-elt-schedules-can-improve-root-cause-analysis-for-data-engineers/ How ELT Schedules Can Improve Root Cause Analysis For Data EngineersRyan KearnsUpdated December 9, 2022 “In this article, Ryan Kearns… discusses the limitations of segmentation analysis when it comes
What I Read: Realtime ML Pipelines
https://medium.com/@nparsons08/challenges-of-building-realtime-ml-pipelines-4782181425c7 Challenges of Building Realtime ML PipelinesNick ParsonsNov 18 “…as companies start introducing realtime into their ML pipelines, they are finding themselves having to weigh the trade-offs between performance, cost,
What I Read: Data Pipeline Smoke Tests
https://dagster.io/blog/smoke-test-data-pipeline The Unreasonable Effectiveness of Data Pipeline Smoke TestsSandy RyzaOctober 19, 2022 “Data practitioners waste time writing unit tests to catch bugs they could have caught with smoke tests.”