https://www.interconnects.ai/p/building-on-evaluation-quicksand
Building on evaluation quicksand
Nathan Lambert
Oct 16, 2024
“In my article on “Big Tech’s LLM evals are just marketing,” I didn’t uncover the deeper reasons as to why can’t fully believe these evaluations.”