Module: tf_agents.bandits.agents.bernoulli_thompson_sampling_agent

Stay organized with collections Save and categorize content based on your preferences.

An Thompson sampling agent for Bernoulli bandit problems.

Classes

class BernoulliBanditVariableCollection: A collection of variables used by BernoulliThompsonSamplingAgent.

class BernoulliThompsonSamplingAgent: A Thompson Sampling agent for non-contextual Bernoulli bandit problems.