1

Discovered Policy Optimisation
In this paper we explore the Mirror Learning space by meta-learning a “drift” function.
Discovered Policy Optimisation
Rainbench: Towards Data-Driven Global Precipitation Forecasting from Satellite Imagery
We introduce RainBench, a new multi-modal benchmark dataset for data-driven precipitation forecasting.
Rainbench: Towards Data-Driven Global Precipitation Forecasting from Satellite Imagery
FACMAC: Factored Multi-Agent Centralised Policy Gradients
We propose multi-agent common knowledge reinforcement learning (MACKRL).
FACMAC: Factored Multi-Agent Centralised Policy Gradients
Multi-Agent Common Knowledge Reinforcement Learning
We propose multi-agent common knowledge reinforcement learning (MACKRL).
Multi-Agent Common Knowledge Reinforcement Learning
The Starcraft Multi-Agent Challenge
We propose the StarCraft Multi-Agent Challenge (SMAC) to measure real progress in MARL.
The Starcraft Multi-Agent Challenge