Module: tf_agents.bandits.agents.linear_thompson_sampling_agent

View source on GitHub

Implements the Linear Thompson Sampling bandit algorithm.


"Thompson Sampling for Contextual Bandits with Linear Payoffs", Shipra Agrawal, Navin Goyal, ICML 2013. The actual algorithm implemented is Algorithm 3 from the supplementary material of the paper from <a href=""></a>.


class LinearThompsonSamplingAgent: Linear Thompson Sampling Agent.