optimization – Andrew Fairless, Ph.D.

What I Read: Reinforcement Learning

By Andrew Fairless on June 12, 2025March 10, 2025

The Interface Between Reinforcement Learning Theory and Language Model Post-Training The Interface Between Reinforcement Learning Theory and Language Model Post-TrainingAkshay Krishnamurthy, Audrey HuangMarch 5, 2025 “Even though existing RLHF methods…Continue readingWhat I Read: Reinforcement Learning

What I Read: group relative policy optimization

By Andrew Fairless on May 22, 2025February 22, 2025

https://superb-makemake-3a4.notion.site/group-relative-policy-optimization-GRPO-18c41736f0fd806eb39dc35031758885 group relative policy optimization (GRPO)Apoorv NandanJan 31, 2025 “GRPO became popular primarily due to the success of deepseek r1, which used this algorithm to train reasoning capabilities into theirContinue readingWhat I Read: group relative policy optimization

What I Read: optimizing softmax

By Andrew Fairless on April 16, 2025January 11, 2025

https://maharshi.bearblog.dev/optimizing-softmax-cuda Learning CUDA by optimizing softmax: A worklogMaharshi Pandya04 Jan, 2025 “Optimizing softmax, especially in the context of GPU programming with CUDA, presents many opportunities for learning.”

What I Read: Polars vs pandas

By Andrew Fairless on March 19, 2025November 20, 2024

https://labs.quansight.org/blog/dataframe-group-by The Polars vs pandas difference nobody is talking aboutMarcoGorelliNovember 11, 2024 “We’ll then take a look at elementary aggregations with both the pandas and Polars APIs. Finally, we’ll lookContinue readingWhat I Read: Polars vs pandas

What I Read: Multi Objective Optimisation

By Andrew Fairless on February 6, 2025October 25, 2024

https://blog.flipkart.tech/multi-objective-optimisation-in-suggestions-ranking-flipkart-49099b951eae?gi=04415d605535 Multi Objective Optimisation in Suggestions Ranking @ FlipkartPranjal SanjanwalaApr 19, 2024 “…we aim to provide a perfectly tailored set of suggestions for that user at that point in time.Continue readingWhat I Read: Multi Objective Optimisation

What I Read: sparsity, PyTorch, Hadamard product

By Andrew Fairless on December 2, 2024August 26, 2024

https://alexshtf.github.io/2024/07/07/HadamardParameterization.html Alex ShtoffFun with sparsity in PyTorch via Hadamard product parametrizationJul 7, 2024 “The beauty of sparsity inducing regularization is that we let our optimizer discover the sparsity patterns, insteadContinue readingWhat I Read: sparsity, PyTorch, Hadamard product

What I Read: tilted loss

By Andrew Fairless on November 25, 2024August 26, 2024

https://alexshtf.github.io/2024/06/14/Untilting.html Alex ShtoffUntilting the tilted lossJun 14, 2024 “Typically in machine learning we train a model by minimizing the average loss…. The parameter t can be thought as a kindContinue readingWhat I Read: tilted loss

What I Read: predicate pushdown

By Andrew Fairless on May 29, 2024March 23, 2024

https://pola.rs/posts/predicate-pushdown-query-optimizer/ The power of predicate pushdownPolars development teamTue, 19 Mar 2024 “…we will dive into the query optimizer and explain how one of the most important optimization rules works: predicateContinue readingWhat I Read: predicate pushdown

What I Read: reliance on AI-assisted decisions

By Andrew Fairless on May 15, 2024March 11, 2024

https://statmodeling.stat.columbia.edu/2024/03/06/defining-optimal-reliance-on-model-predictions-in-ai-assisted-decisions/Defining optimal reliance on model predictions in AI-assisted decisionsJessica Hullman3/6/24 12:31 PM “…AI-assisted decision task is of interest as organizations deploy predictive models to assist human decision-making in domains likeContinue readingWhat I Read: reliance on AI-assisted decisions

What I Read: diffusion distillation

By Andrew Fairless on May 13, 2024March 11, 2024

https://sander.ai/2024/02/28/paradox.html The paradox of diffusion distillationSander DielemanFebruary 28, 2024 “…let’s take a closer look at the various ways in which the number of sampling steps required to get good resultsContinue readingWhat I Read: diffusion distillation

Tag: optimization