https://adamdrake.com/command-line-tools-can-be-235x-faster-than-your-hadoop-cluster.html Command-line Tools can be 235x Faster than your Hadoop ClusterAdam Drake “One especially under-used approach for data processing is using standard shell tools and commands. The benefits of this
What I Read: notebooks, McDonalds of code
https://yobibyte.github.io/notebooks.html notebooks are McDonalds of codeVitaly Kurin “I later saw the extensive use of notebooks everywhere, even in production, and I don’t laugh anymore. I’m scared and sad.”
What I Read: Summarization, LLMs
https://cameronrwolfe.substack.com/p/summarization-and-the-evolution-of Summarization and the Evolution of LLMsCameron R. Wolfe, Ph.D.Jun 03, 2024 “How research on abstractive summarization changed language models forever…”
What I Read: Logarithms, Heteroskedasticity
https://allendowney.substack.com/p/logarithms-and-heteroskedasticity Logarithms and HeteroskedasticityAllen DowneyMay 26, 2024 “A log transform is neither necessary nor sufficient to fix heteroskedasticity”