loss – Andrew Fairless, Ph.D.

What I Read: RL, PPO, GRPO

By Andrew Fairless on May 26, 2025February 22, 2025

https://yugeten.github.io/posts/2025/01/ppogrpo A vision researcher’s guide to some RL stuff: PPO & GRPOYuge (Jimmy) ShiJanuary 31, 2025 “This is a deep dive into Proximal Policy Optimization (PPO), which is one ofContinue readingWhat I Read: RL, PPO, GRPO

What I Read: group relative policy optimization

By Andrew Fairless on May 22, 2025February 22, 2025

https://superb-makemake-3a4.notion.site/group-relative-policy-optimization-GRPO-18c41736f0fd806eb39dc35031758885 group relative policy optimization (GRPO)Apoorv NandanJan 31, 2025 “GRPO became popular primarily due to the success of deepseek r1, which used this algorithm to train reasoning capabilities into theirContinue readingWhat I Read: group relative policy optimization

What I Read: VAE

By Andrew Fairless on May 7, 2025February 2, 2025

https://www.rehansheikh.com/blog/vae What the F*** is a VAE?Rehan SheikhJanuary 23, 2025 “A disentangled variational autoencoder aims for each latent dimension… to correspond to a single factor of variation in your dataset.”

What I Read: tilted loss

By Andrew Fairless on November 25, 2024August 26, 2024

https://alexshtf.github.io/2024/06/14/Untilting.html Alex ShtoffUntilting the tilted lossJun 14, 2024 “Typically in machine learning we train a model by minimizing the average loss…. The parameter t can be thought as a kindContinue readingWhat I Read: tilted loss

What I Read: KL All You Need

By Andrew Fairless on August 21, 2024June 4, 2024

https://blog.alexalemi.com/kl-is-all-you-need.html KL is All You NeedAlexander A. Alemi2024-01-08 “…the core of essentially all modern machine learning methods is a single universal objective: Kullback-Leibler (KL) divergence minimization…. Understand KL, understand theContinue readingWhat I Read: KL All You Need

What I Read: Matryoshka Embedding

By Andrew Fairless on July 17, 2024May 6, 2024

https://huggingface.co/blog/matryoshka Introduction to Matryoshka Embedding ModelsTom AarsenJoshuaOmar SansevieroFebruary 23, 2024 “…Kusupati et al. (2022) were inspired to create embedding models whose embeddings could reasonably be shrunk without suffering too muchContinue readingWhat I Read: Matryoshka Embedding

What I Read: Adversarial Attacks on LLMs

By Andrew Fairless on February 6, 2024December 19, 2023

https://lilianweng.github.io/posts/2023-10-25-adv-attack-llm/ Adversarial Attacks on LLMsLilian WengOctober 25, 2023 “Adversarial attacks are inputs that trigger the model to output something undesired.”

What I Read: LLMs

By Andrew Fairless on September 7, 2023August 1, 2023

https://willthompson.name/what-we-know-about-llms-primer What We Know About LLMs (Primer)Will Thompson (Twitter)July 23, 2023 “…it is worth reflecting on what we concretely know about LLMs at this point in time and how theseContinue readingWhat I Read: LLMs

What I Read: Perspectives on diffusion

By Andrew Fairless on August 30, 2023July 30, 2023

https://sander.ai/2023/07/20/perspectives.html Perspectives on diffusionSander DielemanJuly 20, 2023 “Diffusion models appear to come in many shapes and forms…. these various perspectives each reveal new connections and are a breeding ground forContinue readingWhat I Read: Perspectives on diffusion

What I Read: Applying BERT to Speech

By Andrew Fairless on June 27, 2022June 27, 2022

https://thegradient.pub/an-illustrated-tour-of-applying-bert-to-speech-data/ An Illustrated Tour of Applying BERT to Speech DataJonathan Boigne10.May.2022 “Could we replace the text input in BERT with a speech sequence, mask a part of it, and similarlyContinue readingWhat I Read: Applying BERT to Speech

Tag: loss