https://huggingface.co/blog/not-lain/tensor-dims
Mastering Tensor Dimensions in Transformers
Hafedh Hichri
January 12, 2025
“Most generative AI models are built using a decoder-only architecture. In this blog post, we’ll explore a simple text generation model, as illustrated below.”