Cooperative AI

Amortized Rejection Sampling in Universal Probabilistic Programming
In this paper we develop a new and efficient amortized importance sampling estimator for rejection sampling.
Amortized Rejection Sampling in Universal Probabilistic Programming
Communicating via Markov Decision Processes
We propose a perfectly-secure steganography algorithm for arbitrary covertext distributions.
Communicating via Markov Decision Processes
Discovered Policy Optimisation
In this paper we explore the Mirror Learning space by meta-learning a “drift” function.
Discovered Policy Optimisation