Skip to content

Latest commit

 

History

History
18 lines (14 loc) · 487 Bytes

File metadata and controls

18 lines (14 loc) · 487 Bytes

AgentOccam-Judge

Overview

AgentOccam-Judge is an open-source agent system that achieves strong performance on web interaction tasks, particularly in the WebArena benchmark.

Key Features

  • Open source implementation
  • Web interaction focus
  • Strong performance on WebArena

Performance

WebArena Results

  • 45.7% success rate (3rd best open source solution)

Technical Details

  • Focus: Web navigation and interaction
  • Open source architecture
  • Task completion oriented