https://sergeylevine.substack.com/p/offline-rl-and-large-language-models Offline RL and Large Language ModelsSergey LevineDec 4 “What if the purpose of a language model should not be to generate text at all, at least not directly? …
What I Read: Learning to Imitate
https://ai.stanford.edu/blog/learning-to-imitate/ Learning to ImitateDivyansh GargNovember 1, 2022 “A key aspect of human learning is imitation…. How can we enable our artificial agents to similarly acquire such fast learning ability?”