Domains for Open-world Benchmarking

NovelGym and evaluation suites for hybrid planning + learning agents.

NovelGym benchmarksClear metrics & protocols

Description

We design domains and metrics that evaluate novelty handling across open-world scenarios with hybrid planning+RL agents. We develop openAI gym based domains like NovelGym and NovelGridworlds that provide a flexible framework for benchmarking. We also provide clear metrics and protocols for evaluating novelty handling.

Related Publications

2024NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open WorldsAAMAS
PDF Code Website
2021NovelGridworlds: A Benchmark Environment for Detecting and Adapting to Novelties in Open WorldsAAMAS-ALA Workshop
PDF Code

People

Shivam Goel
Gyan Tatiya
Yichen Wei
Panagiotis Lymperopoulos
Klara Chura

Code

Back to Projects