Deep Multi-Agent Reinforcement Learning

Amortized Rejection Sampling in Universal Probabilistic Programming
In this paper we develop a new and efficient amortized importance sampling estimator for rejection sampling.
Amortized Rejection Sampling in Universal Probabilistic Programming
Communicating via Markov Decision Processes
We propose a perfectly-secure steganography algorithm for arbitrary covertext distributions.
Communicating via Markov Decision Processes