#

multiarm-bandit

Here are 18 public repositories matching this topic...

niffler92 / Bandit

Bandit algorithms

simulation thompson-sampling multiarm-bandit contextual-bandit bandit-algorithms linucb

Updated Oct 12, 2017
Python

duoan / OpenMultiarmedBandits

A open source multi arm bandit framework for optimize your website quickly. You’ll quickly use the benefits of several simple algorithms—including the epsilon-Greedy, Softmax, and Upper Confidence Bound (UCB) algorithms—by working through this framework written in Java, which you can easily adapt for deployment on your own website.

distribution machine-learning recommendation-system recommendation-engine optimization-algorithms abtest statistical-models website-optimization multiarm-bandit bandit-algorithms openmultiarmedbandits

Updated Feb 17, 2018

akshaykhadse / reinforcement-learning

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

sourcecode369 / ml-algorithms-on-scikit-and-keras

Implementation scripts of Machine Learning algorithms on Scikit-learn and Keras for complete novice..

Updated Jul 22, 2018
Jupyter Notebook

Xinjie-Lan / Multi-Armed_Bandit

python implementation of e-Greedy, UCB, LinUCB, LinThompson, and offline evaluator

multiarm-bandit linucb

Updated Oct 17, 2019
Jupyter Notebook

Sayan-Banerjee / Recommendation-System

End to end reinforcement learning based recommendation system.

reinforcement-learning thompson-sampling recommendation-system recommender-system multiarm-bandit

Updated May 1, 2020
Jupyter Notebook

MassimoGennaro / DIA_Project_PoliMi

Data Intelligence Application project

reinforcement-learning pricing advertising multiarm-bandit

Updated Aug 2, 2020
Jupyter Notebook

Bilkent-CYBORG / FeedBAL

Implementation of the FeedBack Adaptive Learning (FeedBAL) algorithm for the episodic multi-armed bandit (eMAB) setting.

reinforcement-learning multiarm-bandit

Updated Sep 14, 2020
Python

viswanath57 / Bandit-Algorithms

algorithms epsilon-greedy multiarm-bandit softmax-algorithm ucb1

Updated Apr 5, 2021
Jupyter Notebook

himeag / Class--Machine-Learning

University of Utah—MKTG 66420 | Taken: Fall 2020

pca-analysis logistic-regression outlier-detection cluster-analysis svm-classifier time-series-analysis multiarm-bandit

Updated Apr 5, 2021
HTML

afiliot / Information-Directed-Sampling-For-Multi-Arm-Bandit-Problems

Review project on Information Directed Sampling - MVA MSc

mva multiarm-bandit information-directed-sampling

Updated Dec 20, 2021
Python

niazangels / bandits

An introduction to multi arm bandits

reinforcement-learning multiarm-bandit bandit-algorithms multiarmed-bandits

Updated Aug 23, 2022
Jupyter Notebook

Nth-iteration-labs / streamingbandit-ui

Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.

react javascript client machine-learning webapp bandit-learning contextual-bandits multiarm-bandit bandit-algorithm streamingbandit-client

Updated Dec 7, 2022
JavaScript

CavenaghiEmanuele / Multi-armed-bandit

Library on Multi-armed bandit

thompson-sampling multiarm-bandit multiarmed-bandits thompson-algorithm

Updated Jan 30, 2023
Python

mobarski / kraken

Contextual Bandit Engine

multi-armed-bandits multi-armed-bandit contextual-bandits multiarm-bandit multiarmed-bandits

Updated Aug 16, 2023
Python

FanchenBao / reinforcement_learning

Code examples for simple reinforcement learning projects

reinforcement-learning actor-critic multiarm-bandit tabular-methods

Updated Oct 5, 2023
Jupyter Notebook

Shahul-Rahman / MABSearch-Learning-the-learning-rate

MABSearch: The Bandit Way of Learning the Learning Rate - A Harmony Between Reinforcement Learning and Gradient Descent

python machine-learning reinforcement-learning optimization global-optimization gradient-descent learning-rate multi-armed-bandit global-optimization-algorithms metaheuristics multiarm-bandit multiarmed-bandits global-minimum

Updated Oct 28, 2023
Jupyter Notebook

cormac-rynne / bandits

Variety of Multi-Arm Bandit (MAB) algorithms using classic and advanced strategies, including tools for experiments and simulations in stationary and nonstationary environments

reinforcement-learning thompson-sampling ucb multiarm-bandit exp3-algorithm multi-arm

Updated Feb 10, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the multiarm-bandit topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multiarm-bandit topic, visit your repo's landing page and select "manage topics."