linear algebra – Andrew Fairless, Ph.D.

What I Read: cosine similarity

By Andrew Fairless on April 27, 2025January 28, 2025

https://p.migdal.pl/blog/2025/01/dont-use-cosine-similarity Don’t use cosine similarity carelesslyPiotr Migdał14 Jan 2025 “…we’ll see that blindly applying cosine similarity to vectors can lead us astray. While embeddings do capture similarities, they often reflectContinue readingWhat I Read: cosine similarity

What I Read: Autoencoders, Interpretability

By Andrew Fairless on March 25, 2025December 1, 2024

https://adamkarvonen.github.io/machine_learning/2024/06/11/sae-intuitions.html An Intuitive Explanation of Sparse Autoencoders for LLM InterpretabilityAdam KarvonenJun 11, 2024 “Sparse Autoencoders (SAEs) have recently become popular for interpretability of machine learning models…”

What I Read: Shapes, Matrix Multiplications

By Andrew Fairless on February 27, 2025November 16, 2024

https://www.thonking.ai/p/what-shapes-do-matrix-multiplications What Shapes Do Matrix Multiplications Like?Horace HeApr 01, 2024 “It has become tribal knowledge that the particular shapes chosen for matmuls has a surprisingly large effect on their performance.Continue readingWhat I Read: Shapes, Matrix Multiplications

What I Read: Gaussians

By Andrew Fairless on February 13, 2025October 25, 2024

https://gestalt.ink/gaussians Understanding Gaussians “The Gaussian distribution, or normal distribution is a key subject in statistics, machine learning, physics, and pretty much any other field that deals with data and probability.”

What I Read: Mamba, State Space

By Andrew Fairless on February 11, 2025October 25, 2024

https://newsletter.maartengrootendorst.com/p/a-visual-guide-to-mamba-and-state A Visual Guide to Mamba and State Space ModelsMaarten GrootendorstFeb 19, 2024 “To further improve LLMs, new architectures are developed that might even outperform the Transformer architecture. One ofContinue readingWhat I Read: Mamba, State Space

What I Read: cosine similarity

By Andrew Fairless on February 5, 2025October 25, 2024

https://tomhazledine.com/cosine-similarity-alternatives Alternatives to cosine similarityTom Hazledine9/20/24 8:00 PM “Cosine similarity is the recommended way to compare vectors, but what other distance functions are there? And are any of them better?”

What I Read: Transformers Inference Optimization

By Andrew Fairless on January 27, 2025October 19, 2024

https://astralord.github.io/posts/transformer-inference-optimization-toolset Transformers Inference Optimization ToolsetAleksandr SamarinOct 1, 2024 “Large Language Models are pushing the boundaries of artificial intelligence, but their immense size poses significant computational challenges. As these models grow,Continue readingWhat I Read: Transformers Inference Optimization

What I Read: Sphering Transform

By Andrew Fairless on January 7, 2025September 29, 2024

Detecting Data Differences Using the Sphering Transform Detecting Data Differences Using the Sphering TransformBy Nina ZumelAugust 20, 2023 “Why do we want to sphere-transform our data? One reason is thatContinue readingWhat I Read: Sphering Transform

What I Read: embedding models

By Andrew Fairless on January 6, 2025September 29, 2024

https://unstructured.io/blog/understanding-embedding-models-make-an-informed-choice-for-your-rag Understanding embedding models: make an informed choice for your RAGMaria KhalusovaAug 13, 2024 “How do you choose a suitable embedding model for your RAG application?”

What I Read: Toy Models of Superposition

By Andrew Fairless on December 19, 2024September 29, 2024

https://transformer-circuits.pub/2022/toy_model/index.html Toy Models of SuperpositionNelson Elhage, Tristan Hume, Catherine Olsson, Nicholas Schiefer, Tom Henighan, Shauna Kravec, Zac Hatfield-Dodds, Robert Lasenby, Dawn Drain, Carol Chen, Roger Grosse, Sam McCandlish, Jared Kaplan,Continue readingWhat I Read: Toy Models of Superposition

Tag: linear algebra