token – Andrew Fairless, Ph.D.

What I Read: memorization, novelty

By Andrew Fairless on May 1, 2025February 1, 2025

https://blog.kjamistan.com/how-memorization-happens-novelty.html How memorization happens: Novelty09 Dezember 2024 “…repeated text and images incentivize training data memorization, but that’s not the only training data that machine learning models memorize. Let’s take aContinue readingWhat I Read: memorization, novelty

What I Read: Tensor Dimensions, Transformers

By Andrew Fairless on April 29, 2025January 28, 2025

https://huggingface.co/blog/not-lain/tensor-dims Mastering Tensor Dimensions in TransformersHafedh HichriJanuary 12, 2025 “Most generative AI models are built using a decoder-only architecture. In this blog post, we’ll explore a simple text generation model,Continue readingWhat I Read: Tensor Dimensions, Transformers

What I Read: Adversarial Attacks on LLMs

By Andrew Fairless on February 6, 2024December 19, 2023

https://lilianweng.github.io/posts/2023-10-25-adv-attack-llm/ Adversarial Attacks on LLMsLilian WengOctober 25, 2023 “Adversarial attacks are inputs that trigger the model to output something undesired.”

Tag: token