RL News
Subscribe
Sign in
Home
Archive
About
Pre-training Offline RL on Wikipedia; Where to put your robot's eyes
Hello! Today (February 1st) was my birthday, and I spent a great deal of time pondering how to spend this year. In the end, I had two broad goals for…
Feb 2, 2022
•
Ryan Lee
RL Weekly 44: Reinforcement Learning with Videos and Automatic Data Augmentation
Dear readers,Last week, I received a couple of emails indicating that one issue every two weeks would work best for them. A big thank you to those who…
Dec 16, 2020
•
Ryan Lee
RL Weekly 43: Revisiting Experience Replay, On-Policy Methods, and Rainbow
Dear readers,Yesterday, DeepMind’s AlphaFold 2 made a major breakthrough for the protein folding problem. It is an exciting work that shows just how…
Dec 1, 2020
•
Ryan Lee
Most Popular
View all
RL Weekly 40: Catastrophic Interference and Policy Evaluation Networks
Mar 30, 2020
•
Ryan Lee
RL Weekly 35: Escaping Local Optimas in Distance-based Rewards and Choosing the Best Teacher
Nov 21, 2019
•
Ryan Lee
RL Weekly 38: Clipped objective is not why PPO works, and the Trap of Saliency maps
Dec 24, 2019
•
Ryan Lee
RL Weekly 44: Reinforcement Learning with Videos and Automatic Data Augmentation
Dec 16, 2020
•
Ryan Lee
RL Weekly 39: Intrinsic Motivation for Cooperation and Amortized Q-Learning
Feb 21, 2020
•
Ryan Lee
RL Weekly 42: Special Issue on NeurIPS 2020 Competitions
Jul 3, 2020
•
Ryan Lee
Latest
Top
RL Weekly 42: Special Issue on NeurIPS 2020 Competitions
Dear readers,This is a special issue of RL Weekly! The NeurIPS 2020 competitions have been announced, and there are four (4) competitions that…
Jul 3, 2020
•
Ryan Lee
RL Weekly 41: Adversarial Policies, Image Augmentation, and Self-Supervised Exploration with World Models
Dear readers,In this issue, we look at adversarial policy learning, image augmentation in RL, and self-supervised exploration through world models.I…
Jun 22, 2020
•
Ryan Lee
RL Weekly 40: Catastrophic Interference and Policy Evaluation Networks
Dear readers,With COVID-19, we are going through difficult times, both as individuals and as members of society. Personally, I had to fly back to Korea…
Mar 30, 2020
•
Ryan Lee
RL Weekly 39: Intrinsic Motivation for Cooperation and Amortized Q-Learning
Dear readers,RL Weekly is back! Sorry for the hiatus, and thank you for your patience. I have been occupied trying to get a summer internship or…
Feb 21, 2020
•
Ryan Lee
RL Weekly 38: Clipped objective is not why PPO works, and the Trap of Saliency maps
Dear readers,Happy holidays! Right after NeurIPS 2019, now the ICLR 2020 results are out! Here are two papers accepted to ICLR 2020 that caught my…
Dec 24, 2019
•
Ryan Lee
RL Weekly 37: Observational Overfitting, Hindsight Credit Assignment, and Procedurally Generated Environment Suite
Dear readers,Happy NeurIPS! This week, I have made my summaries more concise to improve the reading experience. I hope that this change makes the…
Dec 11, 2019
•
Ryan Lee
RL Weekly 36: AlphaZero with a Learned Model achieves SotA in Atari
Dear readers,We have reached 800 subscribers! Thank you so much for subscribing to RL Weekly and sharing it with your friends. I will try my best to…
Nov 28, 2019
•
Ryan Lee
See all
RL News
New results and applications of reinforcement learning
Subscribe
RL News
Subscribe
About
Archive
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts