A library of reinforcement learning components and agents
Stars
4.0k
Forks
536
Watchers
4.0k
Open Issues
98
Overall repository health assessment
No package.json found
This might not be a Node.js project
[pmap] In-line definitions of `jax.device_put_sharded` and `jax.device_put_replicated`.
01beeceView on GitHubResolve unsoundness caught by pytype --strict-none-binding.
882fb08View on GitHubRemove video wrapper to avoid directly calling ffmpeg.
cafbfc5View on GitHubRemove video wrapper to avoid directly calling ffmpeg.
5a1da00View on GitHubRemove video wrapper to avoid directly calling ffmpeg.
15bd3e8View on GitHub[pmap] Handle edge cases in get_from_first_device for jax_pmap_shmap_merge.
e210507View on GitHubReplace unicode escaped characters in ipynb files
05ced10View on GitHub[pmap] Avoid degraded performance under the new `jax.pmap`.
c292597View on GitHubAdd an optional customizable loss callable in the PPONetworks, whose output will be added to total loss.
8ec1a77View on GitHubSilence pytype errors related to improved `jax.nn` type annotations.
6828035View on GitHubAdd Wasserstein Policy Optimization (WPO, http://arxiv.org/pdf/2505.00663) to Acme
ed4e57aView on GitHub134
commits
133
commits
121
commits
83
commits
71
commits
65
commits
22
commits
20
commits
16
commits
14
commits