Modern LLM agents interact with the web through various architectures - from traditional browser automation to API-based approaches. This project provides implementation and evaluation code to systematically compare their effectiveness across 91 realistic e-commerce scenarios.
Stars
1
Forks
1
Watchers
1
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
Update benchmark JSON path and change model name in main function
62a1522View on GitHubEnhance clarity in RAG and MCP interface examples by refining descriptions and labels
88ef291View on GitHubRefactor query parameter naming for consistency across API and HTML examples
5f94bfbView on GitHubUpdate navigation labels for clarity and consistency in the interface
72e7f00View on GitHubAdd interface examples and styling for RAG and MCP comparisons
0d30c4fView on GitHub