AI Evaluation Frameworks

This database provides an overview of AI Evaluation Framework / Benchmark projects at the ETH Agentic Systems Lab. Please click on any available project to find more details.

Disclaimer: For the most current and complete information, please contact the respective project leaders directly. This list is not exhaustive and may not reflect all ongoing work. Some projects may be newly added, evolving, or discontinued over time.

AI Evaluation / Benchmark Projects