Unified benchmark for evaluating architectural design choices in LLM-based single-agent and multi-agent frameworks across orchestration, memory, planning, specialization, and coordination.
benchmarking multi-agent-systems agent-frameworks llm-agents agent-planning agentic-ai agentic-ai-architecture architectural-evaluation agent-memory-system
-
Updated
Feb 3, 2026 - HTML