Gaia Agent Evaluation Runner

Instructions:

  1. Clone this space, then modify the code to define your agent's logic.
  2. Log in to your Hugging Face account using the button below.
  3. Click 'Run Evaluation & Submit All Answers' to fetch questions, run your agent, submit answers, and see results.

Note: Using the HF API can take a few seconds per question.

Questions and Agent Answers

Questions and Agent Answers