labeling – Andrew Fairless, Ph.D.

What I Read: High-Quality Human Data

By Andrew Fairless on April 17, 2024March 1, 2024

https://lilianweng.github.io/posts/2024-02-05-human-data-quality/Thinking about High-Quality Human DataLilian WengFebruary 5, 2024 “High-quality data is the fuel for modern data deep learning model training.”

What I Read: Disagreement Modelling

By Andrew Fairless on August 21, 2023July 9, 2023

https://koaning.io/posts/large-disagreement-models/ Large Disagreement ModellingVincent D. Warmerdam2023-05-26 “So instead of fully relying on a large language model, how might we use it effectively in existing pipelines?”

What I Read: Weak Supervision

By Andrew Fairless on July 11, 2022June 27, 2022

https://www.kdnuggets.com/2022/05/weak-supervision-modeling-explained.html Weak Supervision Modeling, ExplainedBy Frederic Sala, Computer Sciences Department at the University of Winsconsin-MadisonMay 27, 2022 “Weak supervision is a way to obtain labels for training data points forContinue readingWhat I Read: Weak Supervision

What I Read: Bootstrapping Labels

By Andrew Fairless on April 27, 2022March 12, 2022

https://thegradient.pub/bootstrapping-labels-via-_-supervision-human-in-the-loop/ Bootstrapping Labels via _ Supervision & Human-In-The-LoopEugene Yan05.Mar.2022 “Collecting training labels is a seldom discussed art…. In this write-up, we’ll discuss semi, active, and weakly supervised learning, and seeContinue readingWhat I Read: Bootstrapping Labels

What I Read: Systems for Machine Learning

By Andrew Fairless on September 14, 2021August 20, 2021

https://thegradient.pub/systems-for-machine-learning/ Systems for Machine LearningKabir Nagrecha14.Aug.2021 “Machine learning’s increasing importance to real-world applications brought awareness of a new field focused on ML in practice – machine learning systems (or, asContinue readingWhat I Read: Systems for Machine Learning

What I Read: Dataset Curation for NLP Projects

By Andrew Fairless on June 15, 2021May 30, 2021

https://www.kdnuggets.com/2021/05/4-tips-dataset-curation-nlp-projects.html 4 Tips for Dataset Curation for NLP ProjectsBy Paul Barba, Chief Scientist, Lexalytics. “After many years of painfully learned lessons from managing and implementing AI and ML projects, I’veContinue readingWhat I Read: Dataset Curation for NLP Projects

What I Read: Reproducing Deep Double Descent

By Andrew Fairless on February 7, 2021January 11, 2021

https://hippocampus-garden.com/double_descent/ Reproducing Deep Double DescentShion HondaJune 13, 2020 “…you’ve probably heard of bias-variance trade-off… But, do you know that there is a continuation of the U-shaped test risk curve? Surprisingly,Continue readingWhat I Read: Reproducing Deep Double Descent

What I Read: Deep Double Descent: Where Bigger Models and More Data Hurt

By Andrew Fairless on February 6, 2021January 11, 2021

https://arxiv.org/abs/1912.02292 Deep Double Descent: Where Bigger Models and More Data HurtPreetum Nakkiran, Gal Kaplun, Yamini Bansal, Tristan Yang, Boaz Barak, Ilya Sutskever “We show that a variety of modern deepContinue readingWhat I Read: Deep Double Descent: Where Bigger Models and More Data Hurt

What I Read: “Less than one”-shot learning

By Andrew Fairless on February 4, 2021January 10, 2021

https://www.technologyreview.com/2020/10/16/1010566/ai-machine-learning-with-tiny-data/ Artificial intelligence/Machine learningA radical new technique lets AI learn with practically no data“Less than one”-shot learning can teach a model to identify more objects than the number of examplesContinue readingWhat I Read: “Less than one”-shot learning

What I Read: Production with Deep Semi-Supervised Learning

By Andrew Fairless on January 28, 2021January 2, 2021

https://towardsdatascience.com/from-research-to-production-with-deep-semi-supervised-learning-7caaedc39093 From Research to Production with Deep Semi-Supervised LearningVarun NairSep 25 “Semi-supervised learning (SSL), a subfield that combines both supervised and unsupervised learning, has grown in popularity in the deepContinue readingWhat I Read: Production with Deep Semi-Supervised Learning

Tag: labeling