https://mindfulmodeler.substack.com/p/shedding-light-on-impossibility-theorems Shedding light on “Impossibility Theorems for Feature Attribution”: Is SHAP doomed?Christoph MolnarJun 18, 2024 “tl;dr: Don’t use SHAP for counterfactual questions or any questions about “slightly changing feature values
What I Watch: How LLMs store facts
How might LLMs store facts | Chapter 7, Deep Learning3Blue1BrownAug 31, 2024 “Unpacking the multilayer perceptrons in a transformer, and how they may store facts”
What I Watch: compare high dimensional vectors
A new way to compare high dimensional vectorsTunadorableAug 26, 2024 “Surpassing Cosine Similarity for Multidimensional Comparisons: Dimension Insensitive Euclidean Metric (DIEM)”
What I Read: passively learned, causality
What can be passively learned about causality?Simons InstituteAndrew Lampinen (Google DeepMind)Jun 25, 2024 “What could language models learn about causality and experimentation from their passive training?”