https://earthmover.io/blog/cloud-native-dataloader Cloud native data loaders for machine learning using Zarr and XarrayJoe HammanMarch 14, 2024 “…how to efficiently train machine learning models where the inputs are multiple multi-dimensional arrays (a.k.a.
What I Read: How fast process CSV file
https://datapythonista.me/blog/how-fast-can-we-process-a-csv-file How fast can we process a CSV fileMarc GarciaThu 22 February 2024 “…we’ll see in this blog post how to process a CSV file as fast as possible.”
What I Read: SQL order
https://lukianovihor.medium.com/sql-order-of-query-execution-8c7cd926400 SQL — order of query executionIhor LukianovSep 24, 2023 “To maximize your query’s speed on any SQL engine, it’s essential to have an understanding of the SQL execution order.”
What I Read: Database Disassembly
https://materializedview.io/p/databases-are-falling-apart Databases Are Falling Apart: Database Disassembly and Its ImplicationsChris RiccominiJan 29, 2024 “Why are engineers taking databases apart and putting them back together, again?”
What I Read: Unify Batch and ML Systems
https://www.kdnuggets.com/2023/09/hopsworks-unify-batch-ml-systems-feature-training-inference-pipelines Unify Batch and ML Systems with Feature/Training/Inference PipelinesBy Jim Dowling, Co-Founder & CEO, HopsworksSeptember 27, 2023 “This article introduces a unified architectural pattern for building both Batch and Real-Time