natural language processing – Andrew Fairless, Ph.D.

What I Read: LLMs, medicine

By Andrew Fairless on June 11, 2025March 10, 2025

https://www.tanishq.ai/blog/posts/llm-medical-evals LLMs in medicine: evaluations, advances, and the futureTanishq Mathew AbrahamMarch 4, 2025 “Large Language Models (LLMs) have shown significant potential for medical applications yet many challenges remain.”

What I Read: memorization, novelty

By Andrew Fairless on May 1, 2025February 1, 2025

https://blog.kjamistan.com/how-memorization-happens-novelty.html How memorization happens: Novelty09 Dezember 2024 “…repeated text and images incentivize training data memorization, but that’s not the only training data that machine learning models memorize. Let’s take aContinue readingWhat I Read: memorization, novelty

What I Read: Tensor Dimensions, Transformers

By Andrew Fairless on April 29, 2025January 28, 2025

https://huggingface.co/blog/not-lain/tensor-dims Mastering Tensor Dimensions in TransformersHafedh HichriJanuary 12, 2025 “Most generative AI models are built using a decoder-only architecture. In this blog post, we’ll explore a simple text generation model,Continue readingWhat I Read: Tensor Dimensions, Transformers

What I Read: cosine similarity

By Andrew Fairless on April 27, 2025January 28, 2025

https://p.migdal.pl/blog/2025/01/dont-use-cosine-similarity Don’t use cosine similarity carelesslyPiotr Migdał14 Jan 2025 “…we’ll see that blindly applying cosine similarity to vectors can lead us astray. While embeddings do capture similarities, they often reflectContinue readingWhat I Read: cosine similarity

What I Read: impossible languages

By Andrew Fairless on April 23, 2025January 18, 2025

https://www.quantamagazine.org/can-ai-models-show-us-how-people-learn-impossible-languages-point-a-way-20250113/ Can AI Models Show Us How People Learn? Impossible Languages Point a Way.Ben Brubaker1/13/25 11:00 AM “Certain grammatical rules never appear in any known language. By constructing artificial languagesContinue readingWhat I Read: impossible languages

What I Read: Autoencoders, Interpretability

By Andrew Fairless on March 25, 2025December 1, 2024

https://adamkarvonen.github.io/machine_learning/2024/06/11/sae-intuitions.html An Intuitive Explanation of Sparse Autoencoders for LLM InterpretabilityAdam KarvonenJun 11, 2024 “Sparse Autoencoders (SAEs) have recently become popular for interpretability of machine learning models…”

What I Read: Multimodal LLMs

By Andrew Fairless on March 6, 2025November 17, 2024

https://magazine.sebastianraschka.com/p/understanding-multimodal-llms Understanding Multimodal LLMsSebastian Raschka, PhDNov 03, 2024 “In this article, I aim to explain how multimodal LLMs function. Additionally, I will review and summarize roughly a dozen other recentContinue readingWhat I Read: Multimodal LLMs

What I Read: LLMs, School Math

By Andrew Fairless on March 5, 2025November 16, 2024

https://towardsdatascience.com/understanding-llms-from-scratch-using-middle-school-math-e602d27ec876?gi=551c5bfd7f21 Understanding LLMs from Scratch Using Middle School MathRohit PatelOct 19, 2024 “In this article, we talk about how Large Language Models (LLMs) work, from scratch — assuming only thatContinue readingWhat I Read: LLMs, School Math

What I Read: Debate, AI

By Andrew Fairless on March 3, 2025November 16, 2024

Debate May Help AI Models Converge on Truth Debate May Help AI Models Converge on TruthStephen OrnesNovember 8, 2024 “Letting AI systems argue with each other may help expose whenContinue readingWhat I Read: Debate, AI

What I Read: LLM judge

By Andrew Fairless on February 25, 2025November 9, 2024

https://hamel.dev/blog/posts/llm-judge Creating a LLM-as-a-Judge That Drives Business ResultsHamel HusainOctober 29, 2024 “Earlier this year, I wrote Your AI product needs evals. Many of you asked, “How do I get startedContinue readingWhat I Read: LLM judge

Tag: natural language processing