reinforcement learning – Page 3 – Andrew Fairless, Ph.D.

What I Read: Competitive Machine Learning

By Andrew Fairless on May 9, 2023March 30, 2023

https://mlcontests.com/state-of-competitive-machine-learning-2022/ The State of Competitive Machine Learning2022 Edition “We summarise the state of the competitive landscape and analyse the 200+ competitions that took place in 2022. Plus a deep diveContinue readingWhat I Read: Competitive Machine Learning

What I Read: Teach Computers Math

By Andrew Fairless on April 6, 2023February 22, 2023

https://www.quantamagazine.org/to-teach-computers-math-researchers-merge-ai-approaches-20230215/ To Teach Computers Math, Researchers Merge AI ApproachesKevin HartnettFebruary 15, 2023 “Large language models still struggle with basic reasoning tasks. Two new papers that apply machine learning to mathContinue readingWhat I Read: Teach Computers Math

What I Read: Machines Learn, Teach Basics

By Andrew Fairless on March 21, 2023February 6, 2023

https://www.quantamagazine.org/machines-learn-better-if-we-teach-them-the-basics-20230201/ Machines Learn Better if We Teach Them the BasicsMax G. LevyFebruary 1, 2023 “A wave of research improves reinforcement learning algorithms by pre-training them as if they were human.”

What I Read: AI, Human Values

By Andrew Fairless on February 22, 2023December 16, 2022

https://www.quantamagazine.org/what-does-it-mean-to-align-ai-with-human-values-20221213/ What Does It Mean to Align AI With Human Values?Melanie MitchellDecember 13, 2022 “Making sure our machines understand the intent behind our instructions is an important problem that requiresContinue readingWhat I Read: AI, Human Values

What I Read: Offline RL, Large Language Models

By Andrew Fairless on February 21, 2023December 16, 2022

https://sergeylevine.substack.com/p/offline-rl-and-large-language-models Offline RL and Large Language ModelsSergey LevineDec 4 “What if the purpose of a language model should not be to generate text at all, at least not directly? …Continue readingWhat I Read: Offline RL, Large Language Models

What I Read: Causal Confounds, Sequential Decision

By Andrew Fairless on February 6, 2023December 4, 2022

https://blog.ml.cmu.edu/2022/11/28/causal-confounds-in-sequential-decision-making/ Causal Confounds in Sequential Decision MakingGokul SwamyNovember 28, 2022 “Using techniques from causal inference, we derive provably correct and scalable algorithms for sequential decision making in these sorts ofContinue readingWhat I Read: Causal Confounds, Sequential Decision

What I Read: Matrix Multiplication

By Andrew Fairless on January 30, 2023December 4, 2022

https://www.quantamagazine.org/ai-reveals-new-possibilities-in-matrix-multiplication-20221123/ AI Reveals New Possibilities in Matrix MultiplicationBen BrubakerNovember 23, 2022 “Inspired by the results of a game-playing neural network, mathematicians have been making unexpected advances on an age-old mathContinue readingWhat I Read: Matrix Multiplication

What I Read: Learning to Imitate

By Andrew Fairless on January 17, 2023December 4, 2022

https://ai.stanford.edu/blog/learning-to-imitate/ Learning to ImitateDivyansh GargNovember 1, 2022 “A key aspect of human learning is imitation…. How can we enable our artificial agents to similarly acquire such fast learning ability?”

What I Read: The Farama Foundation

By Andrew Fairless on December 22, 2022November 5, 2022

https://farama.org/Announcing-The-Farama-Foundation Announcing The Farama FoundationThe future of open source reinforcement learning25 October 2022 “This means that the barrier to reinforcement learning seeing widespread deployment is a tooling problem…. Our grandContinue readingWhat I Read: The Farama Foundation

What I Read: Pre-Trained Models, Robotics

By Andrew Fairless on December 21, 2022November 5, 2022

https://sergeylevine.substack.com/p/general-purpose-pre-trained-models General-Purpose Pre-Trained Models in RoboticsCan we (pre-) train policies to control any robot for any task?Sergey LevineOct 16 “In robotics, learning policies from a small amount of data isContinue readingWhat I Read: Pre-Trained Models, Robotics

Tag: reinforcement learning