Tired of ChatGPT screenshots? Miss the old days of watching RL agents walking around doing weird stuff?
Look no further–I'm excited to share my @DeepMind internship project, where we develop a method to stabilize optimization in constrained RL.
Link: arxiv.org/abs/2302.01275
🧵
GIF




