https://thegradient.pub/othello/ Do Large Language Models learn world models or just surface statistics?Kenneth Li21.Jan.2023 “Large Language Models (LLM) are on fire…. How do these models achieve this kind of performance? Do
What I Read: Machines Learn, Teach Basics
https://www.quantamagazine.org/machines-learn-better-if-we-teach-them-the-basics-20230201/ Machines Learn Better if We Teach Them the BasicsMax G. LevyFebruary 1, 2023 “A wave of research improves reinforcement learning algorithms by pre-training them as if they were human.”
What I Read: Transformers Training
https://www.borealisai.com/research-blogs/tutorial-17-transformers-iii-training/ Tutorial #17: Transformers III Training08/06/2021P. Xu, S. Prince “…we discuss challenges with transformer training dynamics and introduce some of the tricks that practitioners use to get transformers to converge.”