https://openpipe.ai/blog/fine-tuning-best-practices-series-introduction-and-chapter-1-training-data Fine-tuning Best Practices Series Introduction and Chapter 1: Training DataReid MayoAug 1, 2024 “We’ll explore how to choose the best data, common methods for collecting it, and common methods
What I Read: passively learned, causality
What can be passively learned about causality?Simons InstituteAndrew Lampinen (Google DeepMind)Jun 25, 2024 “What could language models learn about causality and experimentation from their passive training?”
What I Read: Classifying pdfs
https://snats.xyz/pages/articles/classifying_a_bunch_of_pdfs.html Classifying all of the pdfs on the internetSantiago Pedroza2024-08-18 “How would you classify all the pdfs in the internet? Well, that is what I tried doing this time.”