machine learning – Andrew Fairless, Ph.D.

What I Read: RL, PPO, GRPO

By Andrew Fairless on May 26, 2025February 22, 2025

https://yugeten.github.io/posts/2025/01/ppogrpo A vision researcher’s guide to some RL stuff: PPO & GRPOYuge (Jimmy) ShiJanuary 31, 2025 “This is a deep dive into Proximal Policy Optimization (PPO), which is one ofContinue readingWhat I Read: RL, PPO, GRPO

What I Read: group relative policy optimization

By Andrew Fairless on May 22, 2025February 22, 2025

https://superb-makemake-3a4.notion.site/group-relative-policy-optimization-GRPO-18c41736f0fd806eb39dc35031758885 group relative policy optimization (GRPO)Apoorv NandanJan 31, 2025 “GRPO became popular primarily due to the success of deepseek r1, which used this algorithm to train reasoning capabilities into theirContinue readingWhat I Read: group relative policy optimization

What I Read: Reasoning LLMs

By Andrew Fairless on May 21, 2025February 22, 2025

https://magazine.sebastianraschka.com/p/understanding-reasoning-llms Understanding Reasoning LLMsSebastian Raschka, PhDFeb 05, 2025 “This article describes the four main approaches to building reasoning models, or how we can enhance LLMs with reasoning capabilities.”

What I Read: VAE

By Andrew Fairless on May 7, 2025February 2, 2025

https://www.rehansheikh.com/blog/vae What the F*** is a VAE?Rehan SheikhJanuary 23, 2025 “A disentangled variational autoencoder aims for each latent dimension… to correspond to a single factor of variation in your dataset.”

What I Read: memorization, novelty

By Andrew Fairless on May 1, 2025February 1, 2025

https://blog.kjamistan.com/how-memorization-happens-novelty.html How memorization happens: Novelty09 Dezember 2024 “…repeated text and images incentivize training data memorization, but that’s not the only training data that machine learning models memorize. Let’s take aContinue readingWhat I Read: memorization, novelty

What I Read: Adaptive LLMs

By Andrew Fairless on April 30, 2025January 30, 2025

https://sakana.ai/transformer-squared Transformer²: Self-Adaptive LLMssakana.aiJanuary 15, 2025 “Imagine a machine learning system that could adjust its own weights dynamically to thrive in unfamiliar settings, essentially illustrating a system that evolves asContinue readingWhat I Read: Adaptive LLMs

What I Read: Tensor Dimensions, Transformers

By Andrew Fairless on April 29, 2025January 28, 2025

https://huggingface.co/blog/not-lain/tensor-dims Mastering Tensor Dimensions in TransformersHafedh HichriJanuary 12, 2025 “Most generative AI models are built using a decoder-only architecture. In this blog post, we’ll explore a simple text generation model,Continue readingWhat I Read: Tensor Dimensions, Transformers

What I Read: pitfalls, building AI

By Andrew Fairless on April 28, 2025January 28, 2025

https://huyenchip.com//2025/01/16/ai-engineering-pitfalls.html Common pitfalls when building generative AI applicationsChip HuyenJan 16, 2025 “As we’re still in the early days of building applications with foundation models, it’s normal to make mistakes. ThisContinue readingWhat I Read: pitfalls, building AI

What I Read: cosine similarity

By Andrew Fairless on April 27, 2025January 28, 2025

https://p.migdal.pl/blog/2025/01/dont-use-cosine-similarity Don’t use cosine similarity carelesslyPiotr Migdał14 Jan 2025 “…we’ll see that blindly applying cosine similarity to vectors can lead us astray. While embeddings do capture similarities, they often reflectContinue readingWhat I Read: cosine similarity

What I Read: AI, HCI

By Andrew Fairless on April 24, 2025January 28, 2025

https://ianarawjo.medium.com/what-ai-engineers-can-learn-from-qualitative-research-methods-in-hci-5b29b9b7465a What AI engineers can learn from qualitative research methods in HCIIan ArawjoJan 9, 2025 “Meet inductive coding and grounded theory, the new bread-and-butter of LLMOps”

Tag: machine learning