https://earthmover.io/blog/cloud-native-dataloader Cloud native data loaders for machine learning using Zarr and XarrayJoe HammanMarch 14, 2024 “…how to efficiently train machine learning models where the inputs are multiple multi-dimensional arrays (a.k.a.
What I Read: High-Quality Human Data
https://lilianweng.github.io/posts/2024-02-05-human-data-quality/Thinking about High-Quality Human DataLilian WengFebruary 5, 2024 “High-quality data is the fuel for modern data deep learning model training.”
What I Read: Unify Batch and ML Systems
https://www.kdnuggets.com/2023/09/hopsworks-unify-batch-ml-systems-feature-training-inference-pipelines Unify Batch and ML Systems with Feature/Training/Inference PipelinesBy Jim Dowling, Co-Founder & CEO, HopsworksSeptember 27, 2023 “This article introduces a unified architectural pattern for building both Batch and Real-Time