https://blog.alexalemi.com/kl-is-all-you-need.html KL is All You NeedAlexander A. Alemi2024-01-08 “…the core of essentially all modern machine learning methods is a single universal objective: Kullback-Leibler (KL) divergence minimization…. Understand KL, understand the
What I Read: Diffusion models, new theoretical perspective
https://www.chenyang.co/diffusion.html Diffusion models from scratch, from a new theoretical perspectiveChenyang Yuan2023 “This tutorial aims to introduce diffusion models from an optimization perspective…”
What I Read: To Understand Transformers, Focus on Attention
https://drscotthawley.github.io/blog/posts/Transformers1-Attention.html To Understand Transformers, Focus on AttentionScott H. HawleyAugust 21, 2023 “To Understand Transformers, Focus on Attention”