https://distill.pub/2020/circuits/weight-banding/ Weight BandingMichael PetrovChelsea VossLudwig SchubertNick CammarataGabriel GohChris OlahApril 8, 2021DOI 10.23915/distill.00024.009 “Open up any ImageNet conv net and look at the weights in the last layer. You’ll find a
What I Read: Branch Specialization
https://distill.pub/2020/circuits/branch-specialization/ Branch SpecializationChelsea VossGabriel GohNick CammarataMichael PetrovLudwig SchubertChris OlahApril 5, 2021DOI 10.23915/distill.00024.008 “Branch specialization occurs when neural network layers are split up into branches. The neurons and circuits tend to
What I Read: Visualizing Weights
https://distill.pub/2020/circuits/visualizing-weights/ Visualizing WeightsChelsea VossNick CammarataGabriel GohMichael PetrovLudwig SchubertBen EganSwee Kiat LimChris OlahFeb. 4, 2021DOI 10.23915/distill.00024.007 “The problem of understanding a neural network is a little bit like reverse engineering a
What I Read: Why machine learning struggles with causality
https://bdtechtalks.com/2021/03/15/machine-learning-causality/ Why machine learning struggles with causalityBen DicksonMarch 15, 2021 “Why do machine learning models fail at generalizing beyond their narrow domains and training data?”
What I Read: Computer Scientist Who Tackles Inequality
https://www.quantamagazine.org/rediet-abebe-tackles-inequality-with-computer-science-20210401/ A Computer Scientist Who Tackles Inequality Through AlgorithmsRediet Abebe uses the tools of theoretical computer science to understand pressing social problems — and try to fix them.Rachel CrowellContributing WriterApril
What I Read: Deep Learning Recommendation Models
https://www.kdnuggets.com/2021/04/deep-learning-recommendation-models-dlrm-deep-dive.html Deep Learning Recommendation Models (DLRM): A Deep DiveBy Nishant Kumar, Data Science Professional. “This deep dive article presents the architecture and deployment issues experienced with the deep learning recommendation