https://planetbanatt.net/articles/modelmerging.html Model Merging and YouEryk BanattAugust 2024 “Model Merging is a weird and experimental technique which lets you take two models and combine them together to get a new model.”
What I Read: optimizing softmax
https://maharshi.bearblog.dev/optimizing-softmax-cuda Learning CUDA by optimizing softmax: A worklogMaharshi Pandya04 Jan, 2025 “Optimizing softmax, especially in the context of GPU programming with CUDA, presents many opportunities for learning.”
What I Read: ML, Go
https://eli.thegreenplace.net/2024/gomlx-ml-in-go-without-python/#footnote-reference-2 GoMLX: ML in Go without PythonEli BenderskyNovember 22, 2024 at 07:00 “GoMLX is a relatively new Go package for ML that deserves some attention.”
What I Read: Autoencoders, Interpretability
https://adamkarvonen.github.io/machine_learning/2024/06/11/sae-intuitions.html An Intuitive Explanation of Sparse Autoencoders for LLM InterpretabilityAdam KarvonenJun 11, 2024 “Sparse Autoencoders (SAEs) have recently become popular for interpretability of machine learning models…”
What I Read: Mathematics, ML
https://thegradient.pub/shape-symmetry-structure Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning ResearchHenry Kvinge16.Nov.2024 “What is the Role of Mathematics in Modern Machine Learning?”