Gaia Agent Evaluation Runner
Instructions:
- Clone this space, then modify the code to define your agent's logic.
- Log in to your Hugging Face account using the button below.
- Click 'Run Evaluation & Submit All Answers' to fetch questions, run your agent, submit answers, and see results.
Note: Using the HF API can take a few seconds per question.
Questions and Agent Answers