https://huggingface.co/blog/optimize-llm Optimizing your LLM in productionSeptember 15, 2023Patrick von Platen “…efficient LLM deployment…. pros and cons of adopting lower precision, provide a comprehensive exploration of the latest attention algorithms, and
What I Read: LLM-based Products
https://eugeneyan.com/writing/llm-patterns/ Patterns for Building LLM-based Systems & Productseugeneyan “This post is about practical patterns for integrating large language models (LLMs) into systems and products.”