Module: tf_agents.bandits.policies.linear_bandit_policy

View source on GitHub

Linear Bandit Policy.

LinUCB and Linear Thompson Sampling policies derive from this class.


class ExplorationStrategy: Possible exploration strategies.

class LinearBanditPolicy: Linear Bandit Policy to be used by LinUCB, LinTS and possibly others.