https://lilianweng.github.io/posts/2024-02-05-human-data-quality/Thinking about High-Quality Human DataLilian WengFebruary 5, 2024 “High-quality data is the fuel for modern data deep learning model training.”
What I Read: Disagreement Modelling
https://koaning.io/posts/large-disagreement-models/ Large Disagreement ModellingVincent D. Warmerdam2023-05-26 “So instead of fully relying on a large language model, how might we use it effectively in existing pipelines?”
What I Read: Bootstrapping Labels
https://thegradient.pub/bootstrapping-labels-via-_-supervision-human-in-the-loop/ Bootstrapping Labels via _ Supervision & Human-In-The-LoopEugene Yan05.Mar.2022 “Collecting training labels is a seldom discussed art…. In this write-up, we’ll discuss semi, active, and weakly supervised learning, and see
What I Read: Production with Deep Semi-Supervised Learning
https://towardsdatascience.com/from-research-to-production-with-deep-semi-supervised-learning-7caaedc39093 From Research to Production with Deep Semi-Supervised LearningVarun NairSep 25 “Semi-supervised learning (SSL), a subfield that combines both supervised and unsupervised learning, has grown in popularity in the deep