https://horace.io/brrr_intro.html Making Deep Learning Go Brrrr From First PrinciplesHorace He “So, you want to improve the performance of your deep learning model. How might you approach such a task?”
What I Read: Researchers Build AI That Builds AI
https://www.quantamagazine.org/researchers-build-ai-that-builds-ai-20220125/ Researchers Build AI That Builds AIAnil AnanthaswamyJanuary 25, 2022“By using hypernetworks, researchers can now preemptively fine-tune artificial neural networks, saving some of the time and expense of training.”
What I Read: Deep Learning Optimization Theory
https://towardsdatascience.com/deep-learning-optimization-theory-introduction-148b3504b20f?gi=ff0bd10cc9fe Deep Learning Optimization Theory — IntroductionUnderstanding the theory of optimization in deep learning is crucial to enable progress. This post introduces the experimental and theoretical approaches to studying it.Omri
What I Read: Limits Discovered in Quest for Optimal Solutions
https://www.quantamagazine.org/surprising-limits-discovered-in-quest-for-optimal-solutions-20211101/ Surprising Limits Discovered in Quest for Optimal SolutionsMax G. LevyContributing WriterNovember 1, 2021 “Algorithms that zero in on solutions to optimization problems are the beating heart of machine reasoning.
What I Read: Computer Scientists Discover Limits of Major Research Algorithm
https://www.quantamagazine.org/computer-scientists-discover-limits-of-major-research-algorithm-20210817/ Computer Scientists Discover Limits of Major Research AlgorithmNick ThiemeAugust 17, 2021 “Many aspects of modern applied research rely on a crucial algorithm called gradient descent…. researchers have never fully