https://huggingface.co/blog/bloom-megatron-deepspeed The Technology Behind BLOOM TrainingStas BekmanPublished July 14, 2022. “…training ever larger language models has become the norm… the hidden knowledge about how to train such models rarely gets
Data, Science, and Tinkering