September 2024 – Andrew Fairless, Ph.D.

What I Read: Sliding Window Attention

By Andrew Fairless on September 30, 2024July 14, 2024

https://amaarora.github.io/posts/2024-07-04%20SWA.html Sliding Window Attention, Longformer – The Long-Document TransformerAman AroraJuly 4, 2024 “…we will look take a deep dive into Sliding Window Attention (SWA) that was introduced as part ofContinue readingWhat I Read: Sliding Window Attention

What I Read: bare metal to 70B

By Andrew Fairless on September 25, 2024July 8, 2024

https://imbue.com/research/70b-infrastructure From bare metal to a 70B model: infrastructure set-up and scriptsThe Imbue TeamJune 25, 2024 “…we trained a 70B parameter model from scratch on our own infrastructure that outperformedContinue readingWhat I Read: bare metal to 70B

What I Read: Detecting hallucinations, LLMs, semantic entropy

By Andrew Fairless on September 23, 2024July 8, 2024

https://oatml.cs.ox.ac.uk/blog/2024/06/19/detecting_hallucinations_2024.html Detecting hallucinations in large language models using semantic entropySebastian Farquhar, Jannik Kossen, Lorenz Kuhn, Yarin Gal19 Jun 2024 “We show how one can use uncertainty to detect confabulations.”

What I Read: Structured Generation, LLMs

By Andrew Fairless on September 19, 2024July 8, 2024

https://blog.dottxt.co/coding-for-structured-generation.html Coding For Structured Generation with LLMsWill Kurt2024-06-21 “In this post we’re going to go through an example that shows not only how to use structured generation in your LLMContinue readingWhat I Read: Structured Generation, LLMs

What I Read: AI Engineers, Search

By Andrew Fairless on September 18, 2024June 29, 2024

https://softwaredoug.com/blog/2024/06/25/what-ai-engineers-need-to-know-search What AI Engineers Should Know about SearchDoug TurnbullJune 25th, 2024 “Things AI Engineers Should Know about Search”

What I Read: Musings on AI Engineering

By Andrew Fairless on September 16, 2024June 29, 2024

https://www.sh-reya.com/blog/ai-engineering-short Short Musings on AI Engineering and “Failed AI Projects”Shreya ShankarJun 24, 2024 “When we studied the traditional MLOps lifecycle, we found one common pattern across all successful ML products—theirContinue readingWhat I Read: Musings on AI Engineering

What I Read: SSD, Database

By Andrew Fairless on September 12, 2024June 29, 2024

https://cedardb.com/blog/ssd_latency Why Your SSD (Probably) Sucks and What Your Database Can Do About ItLukas VogelJune 18, 2024 “SSDs have effectively replaced spinning disks as the go-to solution for persistent storageContinue readingWhat I Read: SSD, Database

What I Read: Command-line Tools, Faster

By Andrew Fairless on September 10, 2024June 29, 2024

https://adamdrake.com/command-line-tools-can-be-235x-faster-than-your-hadoop-cluster.html Command-line Tools can be 235x Faster than your Hadoop ClusterAdam Drake “One especially under-used approach for data processing is using standard shell tools and commands. The benefits of thisContinue readingWhat I Read: Command-line Tools, Faster

What I Read: Scaling to Multi-Terabyte Datasets

By Andrew Fairless on September 9, 2024June 29, 2024

Lessons Learned from Scaling to Multi-Terabyte Datasets v2thegreat, AliLessons Learned from Scaling to Multi-Terabyte DatasetsJune 19, 2024 “This post is meant to guide you through some of the lessons I’ve learnedContinue readingWhat I Read: Scaling to Multi-Terabyte Datasets

Month: September 2024