large language model – Andrew Fairless, Ph.D.

What I Read: Domain specific architectures

By Andrew Fairless on June 18, 2025March 23, 2025

https://fleetwood.dev/posts/domain-specific-architectures Domain specific architectures for AI inferenceChristopher Fleetwood08/03/2025 “With exploding demand for AI inference, many hardware startups are designing Domain Specific Architectures. Working backwards from the Transformer workload, we willContinue readingWhat I Read: Domain specific architectures

What I Read: Reinforcement Learning

By Andrew Fairless on June 12, 2025March 10, 2025

The Interface Between Reinforcement Learning Theory and Language Model Post-Training The Interface Between Reinforcement Learning Theory and Language Model Post-TrainingAkshay Krishnamurthy, Audrey HuangMarch 5, 2025 “Even though existing RLHF methods…Continue readingWhat I Read: Reinforcement Learning

What I Read: LLMs, medicine

By Andrew Fairless on June 11, 2025March 10, 2025

https://www.tanishq.ai/blog/posts/llm-medical-evals LLMs in medicine: evaluations, advances, and the futureTanishq Mathew AbrahamMarch 4, 2025 “Large Language Models (LLMs) have shown significant potential for medical applications yet many challenges remain.”

What I Read: Science-Literate AI

By Andrew Fairless on June 10, 2025March 10, 2025

https://www.quantamagazine.org/the-physicist-working-to-build-science-literate-ai-20250228/ The Physicist Working to Build Science-Literate AIJohn PavlusFebruary 28, 2025 “By training machine learning models with examples of basic science, Miles Cranmer hopes to push the pace of scientificContinue readingWhat I Read: Science-Literate AI

What I Read: Distributed Systems Programming

By Andrew Fairless on June 5, 2025March 10, 2025

https://www.shadaj.me/writing/distributed-programming-stalled Distributed Systems Programming Has StalledShadaj LaddadFebruary 26, 2025 “…there are tons of frameworks for writing distributed code… they only offer band-aids and sugar over three fixed underlying paradigms… We’reContinue readingWhat I Read: Distributed Systems Programming

What I Read: Model, Product

By Andrew Fairless on June 3, 2025March 10, 2025

https://vintagedata.org/blog/posts/model-is-the-product The Model is the ProductAlexander Doria“There were a lot of speculation over the past years about what the next cycle of AI development could be. Agents? Reasoners? Actual multimodality?Continue readingWhat I Read: Model, Product

What I Read: BAML

By Andrew Fairless on May 28, 2025February 22, 2025

https://thedataquarry.com/posts/baml-is-building-blocks-for-ai-engineers BAML is like building blocks for AI engineersPrashanth Rao2025-02-10 “I’ll explain more about how BAML, a domain-specific language for helping LLMs generate better structured outputs, provides AI engineers theContinue readingWhat I Read: BAML

What I Read: BAML, agentic

By Andrew Fairless on May 27, 2025February 22, 2025

https://thedataquarry.com/posts/baml-and-future-agentic-workflows Why I’m excited about BAML and the future of agentic workflowsPrashanth Rao2025-01-29 “Although there have been new agentic and AI workflow orchestration frameworks coming out seemingly every month latelyContinue readingWhat I Read: BAML, agentic

What I Read: RL, PPO, GRPO

By Andrew Fairless on May 26, 2025February 22, 2025

https://yugeten.github.io/posts/2025/01/ppogrpo A vision researcher’s guide to some RL stuff: PPO & GRPOYuge (Jimmy) ShiJanuary 31, 2025 “This is a deep dive into Proximal Policy Optimization (PPO), which is one ofContinue readingWhat I Read: RL, PPO, GRPO

What I Read: group relative policy optimization

By Andrew Fairless on May 22, 2025February 22, 2025

https://superb-makemake-3a4.notion.site/group-relative-policy-optimization-GRPO-18c41736f0fd806eb39dc35031758885 group relative policy optimization (GRPO)Apoorv NandanJan 31, 2025 “GRPO became popular primarily due to the success of deepseek r1, which used this algorithm to train reasoning capabilities into theirContinue readingWhat I Read: group relative policy optimization

Tag: large language model