https://drivendata.co/blog/ccds-v2 Cookiecutter Data Science V2by Peter Bull, Jay Qi, Chris KucharczykWed 22 May 2024 “…in the past 5 years, a lot has changed in data science tooling and MLOps. CookiecutterContinue readingWhat I Read: Cookiecutter Data Science V2
https://learnbybuilding.ai/tutorials/guiding-llm-output-with-dspy-assertions-and-suggestions Guiding LLM Output with DSPy Assertions and SuggestionsBill Chambers “Assertions in DSPy allow you to define strict rules and constraints that the LLM’s output must (or maybe that youContinue readingWhat I Read: LLM, DSPy Assertions and Suggestions
https://www.aidancooper.co.uk/how-to-beat-proprietary-llms How to Beat Proprietary LLMs With Smaller Open Source ModelsAidan CooperApr 26, 2024 “…we explore the unique advantages of open source LLMs, and how you can leverage them toContinue readingWhat I Read: LLMs, Open Source
https://www.milesmcbain.com/posts/data-analysis-reuse Patterns and anti-patterns of data analysis reuseMiles McBainMarch 10, 2024 “When I look at that function signature I hear the screams of matrices being ground to a fine powderContinue readingWhat I Read: anti-patterns, data reuse
https://martinfowler.com/articles/productize-data-sci-notebooks.html Don’t put data science notebooks into productionDavid Johnston18 November 2020 “…the discussion of how to productionize data science notebooks is missing the point. The goal should be to empowerContinue readingWhat I Read: Don’t put notebooks into production
https://www.aidancooper.co.uk/constrained-decoding A Guide to Structured Generation Using Constrained DecodingApr 8, 2024 “…there are techniques that ensure language models only return outputs that conform to your requirements. This article serves asContinue readingWhat I Read: Structured Generation, Constrained Decoding
https://pythonspeed.com/articles/slow-numba The wrong way to speed up your code with Numbaby Itamar Turner-Trauring21 Mar 2024 “If your NumPy-based code is too slow, you can sometimes use Numba to speed itContinue readingWhat I Read: speed up code, Numba
https://modal.com/blog/serverless-http Lambda on hard mode: Inside Modal’s web infrastructureEric ZhangMarch 14, 2024 “At Modal, we built an HTTP and WebSocket stack on our platform. In other words, your serverless functionsContinue readingWhat I Read: Modal’s web infrastructure
https://bair.berkeley.edu/blog/2024/02/18/compound-ai-systems/ The Shift from Models to Compound AI SystemsMatei Zaharia, Omar Khattab, Lingjiao Chen, Jared Quincy Davis, Heather Miller, Chris Potts, James Zou, Michael Carbin, Jonathan Frankle, Naveen Rao, AliContinue readingWhat I Read: Compound AI Systems
https://newsletter.pragmaticengineer.com/p/scaling-chatgpt Scaling ChatGPT: Five Real-World Engineering ChallengesGergely OroszFeb 20, 2024 “Just one year after its launch, ChatGPT had more than 100M weekly users. In order to meet this explosive demand,Continue readingWhat I Read: Scaling ChatGPT, Engineering Challenges