Paper # |
Authors |
Title |
2 | Ram Rachum, Yonatan Nakar, William Tomlinson, Nitay Alon, Reuth Mirsky | Emergent Dominance Hierarchies in Reinforcement Learning Agents |
3 | Simone Drago, Marco Mussi, Marcello Restelli, Alberto Maria Metelli | Intermediate Observations in Factored-Reward Bandits |
4 | Kyle Crandall, Connor Yates, Corbin Wilhelmi | Lyapunov Guarantees for Learned Policies |
5 | Marc Lanctot, John Schultz, Neil Burch, Max Olan Smith, Daniel Hennes, Thomas Anthony, Julien Perolat | Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning |
7 | Pascal Van der Vaart, Neil Yorke-Smith, Matthijs T. J. Spaan | Bayesian Ensembles for Exploration in Deep Q-Learning |
9 | Jérôme Botoko Ekila, Jens Nevens, Lara Verheyen, Katrien Beuls, Paul Van Eecke | Decentralised Emergence of Robust and Adaptive Linguistic Conventions in Populations of Autonomous Agents Grounded in Continuous Worlds |
10 | Hei Yi Mak, Flint Xiaofeng Fan, Luca A Lanzendörfer, Cheston Tan, Wei Tsang Ooi, Roger Wattenhofer | CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening |
11 | Sunghoon Hong, Whiyoung Jung, Deunsol Yoon, Kanghoon Lee, Woohyung Lim | Agent-Oriented Centralized Critic for Asynchronous Multi-Agent Reinforcement Learning |
13 | David Milec, Ondrej Kubicek, Viliam Lisý | Continual Depth-limited Responses for Computing Counter-strategies in Sequential Games |
14 | Rolando Fernandez, Garrett Warnell, Derrik E. Asher, Peter Stone | Multi-Agent Synchronization Tasks |
15 | Nicole Orzan, Erman Acar, Davide Grossi, Roxana Rădulescu | Learning in Public Goods Games with Non-Linear Utilities: a Multi-Objective Approach |
16 | Argha Boksi, Balaraman Ravindran | Inter-agent Transfer Learning in Communication-constrained Settings : A Student Initiated Advising Approach |
17 | Timothy Flavin, Sandip Sen | A Bayesian Approach to Learning Command Hierarchies for Zero-Shot Multi-Agent Coordination |
18 | Brian Burns, Aravind Sundaresan, Pedro Sequeira, Vidyasagar Sadhu | Learning Sensor Control for Information Gain in Dynamic, Partially Observed and Sparsely Sampled Environments |
22 | Alexandra Cimpean, Catholijn M Jonker, Pieter Jules Karel Libin, Ann Nowe | A Group And Individual Aware Framework For Fair Reinforcement Learning |
23 | Bram M. Renting, Holger Hoos, Catholijn M Jonker | Multi-Agent Meeting Scheduling: A Negotiation Perspective |
24 | Arnau Mayoral Macau, Manel Rodriguez-Soto, Maite López-Sánchez, Juan Antonio Rodriguez Aguilar, Enrico Marchesini, Alessandro Farinelli | An approximate process for designing ethical environments with multi-agent reinforcement learning |
25 | Jonathan G. Faris, Conor F. Hayes, Andre R Goncalves, Kayla G. Sprenger, Daniel faissol, Brenden K. Petersen, Mikel Landajuela, Felipe Leno da Silva | Pareto Front Training For Multi-Objective Symbolic Optimization |
28 | Zun Li, Michael Wellman | A Meta-Game Evaluation Framework for Deep Multiagent Reinforcement Learning |
29 | Jérôme Arjonilla, Tristan Cazenave and Abdallah Saffidine | Enhancing Reinforcement Learning Through Guided Search |
32 | Jérôme Arjonilla, Tristan Cazenave and Abdallah Saffidine | Perfect Information Monte Carlo with postponing reasoning |
33 | Radovan Haluška and Martin Schmid | Learning to Beat ByteRL: Exploitability of Collectible Card Game Agents |