Domains for Open-world Benchmarking

NovelGym and evaluation suites for hybrid planning + learning agents.

NovelGym benchmarksClear metrics & protocols
NovelGym
Description
We design domains and metrics that evaluate novelty handling across open-world scenarios with hybrid planning+RL agents. We develop openAI gym based domains like NovelGym and NovelGridworlds that provide a flexible framework for benchmarking. We also provide clear metrics and protocols for evaluating novelty handling.
Related Publications
  • 2024NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open WorldsAAMAS
  • 2021NovelGridworlds: A Benchmark Environment for Detecting and Adapting to Novelties in Open WorldsAAMAS-ALA Workshop
People
  • Shivam Goel
  • Gyan Tatiya
  • Yichen Wei
  • Panagiotis Lymperopoulos
  • Klara Chura
Shivam Goel