policy gradient – Andrew Fairless, Ph.D.

Skip to content

Andrew Fairless, Ph.D.

Data, Science, and Tinkering

Overview
Experience and Education
Publications
SHAP Tutorial
Understanding the Basics of Bayesian Linear Regression
Classifying Medicine
The Peanuts Project

Search for:

Tag: policy gradient

Home/Posts tagged Tag: policy gradient

What I Read: How Generally Capable Agents Trained

By BylineAndrew Fairless on September 23, 2021September 4, 2021

https://www.lesswrong.com/posts/DreKBuMvK7fdESmSJ/how-deepmind-s-generally-capable-agents-were-trained How DeepMind’s Generally Capable Agents Were Trainedby 1a3orn20th Aug 2021 “One of DeepMind’s latest papers… explains how DeepMind produced agents that can successfully play games as complex as hide-and-seekContinue readingWhat I Read: How Generally Capable Agents Trained

Tags

artificial intelligence attention Bayesian chatbot classification cloud cognition computer vision database data engineering data science deployment efficiency embedding generalization generative GPU graph healthcare image interpretability large language model latency linear algebra machine learning medicine MLOps monitoring natural language processing neural network neuroscience optimization pipeline probability Python recurrent regression reinforcement learning scalability software engineering SQL statistics training transformer unit test

Scroll Up