AgentOccam-Judge is an open-source agent system that achieves strong performance on web interaction tasks, particularly in the WebArena benchmark.
- Open source implementation
- Web interaction focus
- Strong performance on WebArena
- 45.7% success rate (3rd best open source solution)
- Focus: Web navigation and interaction
- Open source architecture
- Task completion oriented