[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
Stars
53
Forks
2
Watchers
Open Issues
1
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
18
commits
3
add one-step evaluation metric
54b23ab
add demo video
9b0a707
da8d30c
Update README
d27ab53
f08edce
add open_llm_fuzzy_match
b8b6dff
add cogagent implementation & new fuzzy match
2ee442b
[update] support openai>1.0.0 and openai azure
8ab58ef
Update README.md
f1273ba
dffba74
6a123ca
be3ada3
efecaff
modify run.py
f8e75c2
update run.py and datadir
628f4b8