Publications

Communicating via Markov Decision Processes
We propose a perfectly-secure steganography algorithm for arbitrary covertext distributions.
Communicating via Markov Decision Processes
Amortized Rejection Sampling in Universal Probabilistic Programming
In this paper we develop a new and efficient amortized importance sampling estimator for rejection sampling.
Amortized Rejection Sampling in Universal Probabilistic Programming
Discovered Policy Optimisation
In this paper we explore the Mirror Learning space by meta-learning a “drift” function.
Discovered Policy Optimisation
Rainbench: Towards Data-Driven Global Precipitation Forecasting from Satellite Imagery
We introduce RainBench, a new multi-modal benchmark dataset for data-driven precipitation forecasting.
Rainbench: Towards Data-Driven Global Precipitation Forecasting from Satellite Imagery
Multi-Agent Common Knowledge Reinforcement Learning
We propose multi-agent common knowledge reinforcement learning (MACKRL).
Multi-Agent Common Knowledge Reinforcement Learning
FACMAC: Factored Multi-Agent Centralised Policy Gradients
We propose multi-agent common knowledge reinforcement learning (MACKRL).
FACMAC: Factored Multi-Agent Centralised Policy Gradients
The Starcraft Multi-Agent Challenge
We propose the StarCraft Multi-Agent Challenge (SMAC) to measure real progress in MARL.
The Starcraft Multi-Agent Challenge