https://huggingface.co/blog/bloom-megatron-deepspeed The Technology Behind BLOOM TrainingStas BekmanPublished July 14, 2022. “…training ever larger language models has become the norm… the hidden knowledge about how to train such models rarely gets
What I Read: Graph Neural Networks, Differential Geometry, Algebraic Topology
https://towardsdatascience.com/graph-neural-networks-through-the-lens-of-differential-geometry-and-algebraic-topology-3a7c3c22d5f Graph Neural Networks through the lens of Differential Geometry and Algebraic TopologyMichael Bronstein “Differential geometry and algebraic topology are not encountered very frequently in mainstream machine learning… tools from
What I Read: Deep Learning Recommendation Models
https://www.kdnuggets.com/2021/04/deep-learning-recommendation-models-dlrm-deep-dive.html Deep Learning Recommendation Models (DLRM): A Deep DiveBy Nishant Kumar, Data Science Professional. “This deep dive article presents the architecture and deployment issues experienced with the deep learning recommendation