machine learning – Page 4 – Andrew Fairless, Ph.D.

What I Read: embedding models

By Andrew Fairless on January 6, 2025September 29, 2024

https://unstructured.io/blog/understanding-embedding-models-make-an-informed-choice-for-your-rag Understanding embedding models: make an informed choice for your RAGMaria KhalusovaAug 13, 2024 “How do you choose a suitable embedding model for your RAG application?”

What I Read: Toy Models of Superposition

By Andrew Fairless on December 19, 2024September 29, 2024

https://transformer-circuits.pub/2022/toy_model/index.html Toy Models of SuperpositionNelson Elhage, Tristan Hume, Catherine Olsson, Nicholas Schiefer, Tom Henighan, Shauna Kravec, Zac Hatfield-Dodds, Robert Lasenby, Dawn Drain, Carol Chen, Roger Grosse, Sam McCandlish, Jared Kaplan,Continue readingWhat I Read: Toy Models of Superposition

What I Read: Is SHAP doomed?

By Andrew Fairless on December 18, 2024September 3, 2024

https://mindfulmodeler.substack.com/p/shedding-light-on-impossibility-theorems Shedding light on “Impossibility Theorems for Feature Attribution”: Is SHAP doomed?Christoph MolnarJun 18, 2024 “tl;dr: Don’t use SHAP for counterfactual questions or any questions about “slightly changing feature valuesContinue readingWhat I Read: Is SHAP doomed?

What I Watch: How LLMs store facts

By Andrew Fairless on December 16, 2024September 3, 2024

How might LLMs store facts | Chapter 7, Deep Learning3Blue1BrownAug 31, 2024 “Unpacking the multilayer perceptrons in a transformer, and how they may store facts”

What I Read: Fine-tuning

By Andrew Fairless on December 11, 2024September 3, 2024

https://openpipe.ai/blog/fine-tuning-best-practices-series-introduction-and-chapter-1-training-data Fine-tuning Best Practices Series Introduction and Chapter 1: Training DataReid MayoAug 1, 2024 “We’ll explore how to choose the best data, common methods for collecting it, and common methodsContinue readingWhat I Read: Fine-tuning

What I Read: passively learned, causality

By Andrew Fairless on December 4, 2024August 26, 2024

What can be passively learned about causality?Simons InstituteAndrew Lampinen (Google DeepMind)Jun 25, 2024 “What could language models learn about causality and experimentation from their passive training?”

What I Read: Evaluating LLM-Evaluators

By Andrew Fairless on December 3, 2024August 26, 2024

https://eugeneyan.com/writing/llm-evaluators Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge)Eugene Yan “After reading this, you’ll gain an intuition on how to apply, evaluate, and operate LLM-evaluators. We’ll learn when to apply (i)Continue readingWhat I Read: Evaluating LLM-Evaluators

What I Read: sparsity, PyTorch, Hadamard product

By Andrew Fairless on December 2, 2024August 26, 2024

https://alexshtf.github.io/2024/07/07/HadamardParameterization.html Alex ShtoffFun with sparsity in PyTorch via Hadamard product parametrizationJul 7, 2024 “The beauty of sparsity inducing regularization is that we let our optimizer discover the sparsity patterns, insteadContinue readingWhat I Read: sparsity, PyTorch, Hadamard product

What I Read: tilted loss

By Andrew Fairless on November 25, 2024August 26, 2024

https://alexshtf.github.io/2024/06/14/Untilting.html Alex ShtoffUntilting the tilted lossJun 14, 2024 “Typically in machine learning we train a model by minimizing the average loss…. The parameter t can be thought as a kindContinue readingWhat I Read: tilted loss

What I Read: Classifying pdfs

By Andrew Fairless on November 21, 2024August 26, 2024

https://snats.xyz/pages/articles/classifying_a_bunch_of_pdfs.html Classifying all of the pdfs on the internetSantiago Pedroza2024-08-18 “How would you classify all the pdfs in the internet? Well, that is what I tried doing this time.”

Tag: machine learning