Benchmarked agent execution runtime for Python. Sub-10ms cold starts, real-time streaming, time-travel debugging, and self-growing tool libraries. Compare 3 sandbox backends: Docker (OpenSandbox), MicroVM, and in-process AST.
Stars
8
Forks
2
Watchers
8
Open Issues
3
Overall repository health assessment
No package.json found
This might not be a Node.js project
104
commits
Add Azure AI Foundry provider support and benchmark CLI improvements
02b663cView on GitHubAdd conceptdrift benchmark harness, asset scripts, and gitignore for cache/result
cee3befView on GitHubRefactor benchmark CLI to use pathlib for file path handling
1ba436eView on GitHubEnhance SkillsBench integration and documentation for skill evaluation
73796faView on GitHubUpdate README for clarity on Recursive Language Models and context handling
1440466View on GitHubUpdate project files and documentation for improved clarity and functionality
4a5ba98View on GitHubUpdate authors list in project metadata to remove URL for Mantix AI Research
b38d111View on GitHubUpdate project metadata to include Mantix AI Research as a contributor
fe7e59bView on GitHubUpdate README and benchmark documentation to reflect GPT-5.2 results and version bump
1736fffView on GitHubUpdate README to reflect changes in PTC benchmark performance metrics and insights
76d2f27View on GitHubUpdate GitHub Actions workflow to use actions/upload-artifact@v4
6593fd1View on GitHubUpdate benchmark tests to focus on PTC tasks and enhance integration configurations
3c4c5fcView on GitHubAdd PTC-Bench Interactive Dashboard for Benchmark Visualization
1c23729View on GitHubIntroduce PTC-Bench: A systematic benchmark for Programmatic Tool Calling (PTC) vs Function Calling (FC)
1a0b2feView on GitHub