machine learning – Page 6 – Andrew Fairless, Ph.D.

What I Read: Hidden Infinity, Preference Learning

By Andrew Fairless on October 10, 2024July 14, 2024

https://www.cs.princeton.edu/~smalladi/blog/2024/07/09/dpo-infinity The Hidden Infinity in Preference LearningSadhika MalladiJuly 09 2024 “I demonstrate from first principles how offline preference learning algorithms (e.g., SimPO) can benefit from length normalization, especially when trainingContinue readingWhat I Read: Hidden Infinity, Preference Learning

What I Read: Illustrated AlphaFold

By Andrew Fairless on October 9, 2024July 14, 2024

https://elanapearl.github.io/blog/2024/the-illustrated-alphafold The Illustrated AlphaFoldElana Simon, Jake Silberg “A visual walkthrough of the AlphaFold3 architecture…”

What I Read: Extrinsic Hallucinations, LLMs

By Andrew Fairless on October 7, 2024July 14, 2024

https://lilianweng.github.io/posts/2024-07-07-hallucination Extrinsic Hallucinations in LLMsLilian WengJuly 7, 2024 “This post focuses on extrinsic hallucination. To avoid hallucination, LLMs need to be (1) factual and (2) acknowledge not knowing the answerContinue readingWhat I Read: Extrinsic Hallucinations, LLMs

What I Read: What can LLMs never do?

By Andrew Fairless on October 1, 2024July 14, 2024

https://www.strangeloopcanon.com/p/what-can-llms-never-do What can LLMs never do?Rohit KrishnanApr 23, 2024 “Every time over the past few years that we came up with problems LLMs can’t do, they passed them with flyingContinue readingWhat I Read: What can LLMs never do?

What I Read: Sliding Window Attention

By Andrew Fairless on September 30, 2024July 14, 2024

https://amaarora.github.io/posts/2024-07-04%20SWA.html Sliding Window Attention, Longformer – The Long-Document TransformerAman AroraJuly 4, 2024 “…we will look take a deep dive into Sliding Window Attention (SWA) that was introduced as part ofContinue readingWhat I Read: Sliding Window Attention

What I Read: Musings on AI Engineering

By Andrew Fairless on September 16, 2024June 29, 2024

https://www.sh-reya.com/blog/ai-engineering-short Short Musings on AI Engineering and “Failed AI Projects”Shreya ShankarJun 24, 2024 “When we studied the traditional MLOps lifecycle, we found one common pattern across all successful ML products—theirContinue readingWhat I Read: Musings on AI Engineering

What I Read: LLMs train LLMs

By Andrew Fairless on September 4, 2024June 15, 2024

https://sakana.ai/llm-squared Can LLMs invent better ways to train LLMs?Sakana AIJune 13, 2024 “…LLMs themselves have grown increasingly capable of generating hypotheses and writing code. This raises an intriguing question: canContinue readingWhat I Read: LLMs train LLMs

What I Read: Summarization, LLMs

By Andrew Fairless on September 3, 2024June 15, 2024

https://cameronrwolfe.substack.com/p/summarization-and-the-evolution-of Summarization and the Evolution of LLMsCameron R. Wolfe, Ph.D.Jun 03, 2024 “How research on abstractive summarization changed language models forever…”

What I Read: KL All You Need

By Andrew Fairless on August 21, 2024June 4, 2024

https://blog.alexalemi.com/kl-is-all-you-need.html KL is All You NeedAlexander A. Alemi2024-01-08 “…the core of essentially all modern machine learning methods is a single universal objective: Kullback-Leibler (KL) divergence minimization…. Understand KL, understand theContinue readingWhat I Read: KL All You Need

Tag: machine learning