https://imbue.com/research/70b-infrastructure
From bare metal to a 70B model: infrastructure set-up and scripts
The Imbue Team
June 25, 2024
“…we trained a 70B parameter model from scratch on our own infrastructure that outperformed zero-shot GPT-4o on reasoning-related tasks. Today, we’re sharing an end-to-end guide for setting up the required infrastructure…”