https://lilianweng.github.io/posts/2024-11-28-reward-hacking Reward Hacking in Reinforcement LearningLilian WengNovember 28, 2024 “Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or ambiguities in the reward function to achieve high rewards,
What I Read: data engineering
https://javisantana.com/2024/11/30/learnings-after-4-years-data-eng.html Learnings after 4 years working with +50 companies on data engineering projectsJavi Santana “I like to call it “high performance data engineering”…. Some practical learnings, in no particular order…”
What I Read: Autoencoders, Interpretability
https://adamkarvonen.github.io/machine_learning/2024/06/11/sae-intuitions.html An Intuitive Explanation of Sparse Autoencoders for LLM InterpretabilityAdam KarvonenJun 11, 2024 “Sparse Autoencoders (SAEs) have recently become popular for interpretability of machine learning models…”
What I Read: Mathematics, ML
https://thegradient.pub/shape-symmetry-structure Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning ResearchHenry Kvinge16.Nov.2024 “What is the Role of Mathematics in Modern Machine Learning?”
What I Read: Replacements
https://tech.instacart.com/how-instacart-uses-machine-learning-to-suggest-replacements-for-out-of-stock-products-8f80d03bb5af?gi=a743b3b54c9f How Instacart Uses Machine Learning to Suggest Replacements for Out-of-Stock ProductsAhsaas BajajNov 7, 2024 “You’ve carefully chosen each item, but then you’re notified that some products might not be