https://huggingface.co/blog/clefourrier/llm-evaluation Let’s talk about LLM evaluationMay 23, 2024Clémentine Fourrier “There are, to my knowledge, at the moment, 3 main ways to do evaluation: automated benchmarking, using humans as judges, and
What I Read: Attention, transformers
Attention in transformers, visually explained | Chapter 6, Deep Learning3Blue1Brown “Demystifying attention, the key mechanism inside transformers and LLMs.”
What I Read: Mamba Explained
https://thegradient.pub/mamba-explained Mamba ExplainedKola Ayonrinde27.Mar.2024 “Mamba promises similar performance (and crucially similar scaling laws) as the Transformer whilst being feasible at long sequence lengths (say 1 million tokens).”