Module: tf_agents.bandits.agents.lin_ucb_agent

View source on GitHub

Implements the Linear UCB bandit algorithm.

Reference:

"A Contextual Bandit Approach to Personalized News Article Recommendation", Lihong Li, Wei Chu, John Langford, Robert Schapire, WWW 2010.

Classes

class LinearUCBAgent: An agent implementing the Linear UCB bandit algorithm.