https://gregorygundersen.com/blog/2020/04/11/moments/ Understanding MomentsGregory Gundersen11 April 2020 “Why are a distribution’s moments called “moments”? How does the equation for a moment capture the shape of a distribution? Why do we typically
What I Read: Distributed Training, Finetuning
https://sumanthrh.com/post/distributed-and-efficient-finetuning/ Everything about Distributed Training and Efficient FinetuningSumanth R HegdeLast updated on Oct 13, 2023 “practical guidelines and gotchas with multi-GPU and multi-node training”
What I Read: Retrieval Augmented Generation at scale
https://medium.com/@neum_ai/retrieval-augmented-generation-at-scale-building-a-distributed-system-for-synchronizing-and-eaa29162521 Retrieval Augmented Generation at scale — Building a distributed system for synchronizing and ingesting billions of text embeddingsNeum AISep 28 “…getting a Retrieval Augmented Generation (RAG) application started is
What I Read: AI System Beats Chess Puzzles
https://www.quantamagazine.org/google-deepmind-trains-artificial-brainstorming-in-chess-ai-20231115/ AI System Beats Chess Puzzles With ‘Artificial Brainstorming’Stephen OrnesNovember 15, 2023 “By bringing together disparate approaches, machines can reach a new level of creative problem-solving.”
What I Read: Tiny Language Models
https://www.quantamagazine.org/tiny-language-models-thrive-with-gpt-4-as-a-teacher-20231005/ Tiny Language Models Come of AgeBen Brubaker10/5/23 10:50 AM “To better understand how neural networks learn to simulate writing, researchers trained simpler versions on synthetic children’s stories.”