DM$^2$: Decentralized Multi-Agent Reinforcement Learning for Distribution Matching
Published in AAAI 2023, 2023
Estimating and minimizing the Wasserstein distance to an idealized target distribution to learn a goal-conditioned policy
Recommended citation: Caroline Wang, Ishan Durugkar, Elad Liebman, Peter Stone. (2023). "DM2: Decentralized Multi-Agent Reinforcement Learning for Distribution Matching". AAAI, 2023. https://arxiv.org/abs/2206.00233