Found 978 repositories(showing 30)
johnmyleswhite
Code for my book on Multi-Armed Bandit Algorithms
david-cortes
Python implementations of contextual bandits algorithms
st-tech
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
SMPyBandits
🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on
HCDM
Library of contextual bandits algorithms
c-bata
A hyperparameter optimization framework, inspired by Optuna.
alison-carrera
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
alison-carrera
:bust_in_silhouette: Multi-Armed Bandit Algorithms Library (MAB) :cop:
WilliamLwj
PyXAB - A Python Library for X-Armed Bandit and Online Blackbox Optimization Algorithms
KKeishiro
Yahoo! news article recommendation system by linUCB
Unity-Technologies
Demo project using multi-armed bandit algorithm
kfoofw
Bandit algorithms simulations for online learning
raffg
Monte Carlo simulations of several different multi-armed bandit algorithms and a comparison with classical statistical A/B testing
pingles
Multi-armed bandit algorithms in Clojure
tor
Library for Multi-Armed Bandit Algorithms
jldbc
Multi-Armed Bandit algorithms applied to the MovieLens 20M dataset
rayshi14
Hybrid Linear UCB bandit learning algorithm L Li(2010) python code
sshkhr
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
j2kun
Python code for the post "Adversarial Bandits and the Exp3 Algorithm"
timnugent
Epsilon-greedy, softmax and LinUCB contextual bandit implementations [recommender systems]
abietti
scripts for evaluation of contextual bandit algorithms
microsoft
SelfTune is an RL framework that enables systems and service developers to automatically tune various configuration parameters and other heuristics in their codebase, rather than manually-tweaking, over time in deployment. It provides easy-to-use API (Python, C# bindings) and is driven by bandit-style RL & online gradient-descent algorithms.
jwmueller
Bandit algorithms for dynamic pricing of many products
danisola
Bandit algorithms and test framework in Java
mreid
A JavaScript demo of some multi-armed bandits algorithms
SahanaRamnath
Implementation of various multi-armed bandits algorithms on a 10-arm testbed.
Ragnaroek
Bandit Algorithms in Rust
akshaykr
Experimentation for oracle based contextual bandit algorithms.
thunfischtoast
Contextual bandit algorithm called LinUCB / Linear Upper Confidence Bounds as proposed by Li, Langford and Schapire
niffler92
Bandit algorithms