https://arxiv.org/abs/2402.17764 The Era of 1-bit LLMs: All Large Language Models are in 1.58 BitsShuming Ma, Hongyu Wang, Lingxiao Ma, Lei Wang, Wenhui Wang, Shaohan Huang, Li Dong, Ruiping Wang, Jilong
What I Read: SwAV method
https://m0nads.wordpress.com/2021/01/08/the-swav-method/ The SwAV methodAntonio Ferraioli “The SwAV method is a clustering-based online method whose goal is to learn visual features in an online fashion without supervision.”
What I Read: Revisiting Sutton’s Bitter Lesson for AI
https://blog.exxactcorp.com/compute-goes-brrr-revisiting-suttons-bitter-lesson-artificial-intelligence/ Deep LearningCompute Goes Brrr: Revisiting Sutton’s Bitter Lesson for Artificial IntelligenceMarketing, October 27, 2020 “The main driver of AI progress, according to Sutton, is the increasing availability of compute